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This book has been out of print for many years now, but the 
demand for it continues. Electrodynamics and the classical theory 
of fields remain very much alive and continue to be the source 
of inspiration for much of the modern research work in new physical 
theories. The treatment of the foundations of electrodynamics in 
this book is still up-to-date. I welcome, therefore, this new Dover 
edition. 

A number of misprints and mistakes in the original 1964 edition 
have been corrected. I am grateful to many colleagues and students, 
especially James T. Cushing, Guillermo Garcia Alcaine and W. C. 
Bolling, for pointing errors out to me. 


A. O. B. 
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ore TO THE 
FIRST EDITION 


This book is primarily intended as an introduction on the 
classical level to quantized Lorentz invariant field theories. Much 
of the subject matter treated in this book has gained interest and 
been developed only after the advance of relativistic quantum 
theories. In spite of the shortcomings of the quantum field theory, 
the study of field theory and the particle-field and particle-particle 
action-at-a-distance interactions will be part of the education of 
theoretical physicists, guiding them in the foreseeable future. The 
need was therefore felt for a modern description of classical field 
theories, especially those aspects of quantum theories which one 
can already study on the classical level. Electrodynamics plays the 
central role throughout the book as the prototype of interacting 
fields and particles. The gravitational field is not included, but it 
is hoped that this book may also serve as an introduction to generally 
covariant field theories. 

I should like to thank my colleagues W. E. Brittin, J. Currin, 
and D. E. Zwanziger for their suggestions after reading pertinent 
parts of the manuscript. 
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The two basic and most fundamental concepts of classical 
“atomistic” physical theories—in contrast to phenomenological 
theories like thermodynamics—are that of particle and field. 
Broadly speaking, this book deals with the interactions of systems 
consisting of particles and fields and, in particular as a prototype, 
with the interactions of charged particles with the electromagnetic 
field. These two concepts or entities are quite distinct in classical 
theories and they are described by very different mathematical 
apparatus: on the one hand, space-time points as the positions of the 
particles; on the other hand, functions of space-time as the field 
intensity or the potential. Only in relativistic quantum theories do 
we atrive at a synthesis of these two concepts, namely to attempt 
to describe everything in terms of fields alone, or in terms of par- 
ticles alone. We cite the quantum field theory and the more recent 
S-matrix theory for these two directions. Even in classical theories 
one can find similar directions: the action-at-a-distance electro- 
dynamics treats the interactions of particles without fields; and one 
can also represent in a limited sense the particles by classical fields. 
The process of svnthesis is still unfinished and incomplete, even in 
the domain of quantum theories. In the main, particles and fields 
in this book have been treated distinctly. Many of the physical 
and mathematical ideas and methods of relativistic classical theories 
are also basic to relativistic quantum theories. 

The treatment throughout the book is relativistic; we have 
attempted, as much as possible, a coordinate free (or covariant) 
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form of the equations both for particles and the fields. The special 
theory of relativity is an outcome of Maxwell’s equations and the 
relativistic space-time is the natural framework to use in electro- 
dynamics and in classical field theories. For this reason we have 
attempted in Chapter I, Part I, to give a fairly complete discussion 
of space-time, Lorentz transformations, Lorentz-group and tensor 
and spinor fields. A fluent knowledge of the material of this chapter 
is essential, not only for this book but for many other branches of 
theoretical physics, in particular for relativistic quantum theory. 
However, some sections, such as the representations of the Lorentz- 
group, may be omitted at first reading. 

The fundamental problem of dynamics is to describe the tra- 
jectories of particles for given external forces. Various relativistic 
forms of this problem are treated in Chapter II. In field theory 
the basic entities are a set of functions of space and time which 
describe a field. For example, the six functions B(x), E(x) describe 
the Maxwell field. These functions are not arbitrary; the class of 
admissible functions is defined by the solutions of the so-called 
field equations. The general dynamical principles to obtain the 
field equations and the important problem of the conservation 
laws are discussed in Chapter III. In electrodynamics we take 
mainly the microscopic point of view of Lorentz which deals with 
fields and charges in vacuum. The Maxwell equations in matter, 
i.e., in metals or dielectrics, are of phenomenological nature using 
average quantities, and are discussed only briefly in this chapter. 

To the initial value problem of dynamics there corresponds in 
field theory the so-called Cauchy problem. Given the field quantities 
and their derivatives everywhere at a given time, the problem is to 
determine the field at a later time, In particular, it is of importance 
to find the solutions of the field equations in the presence of inter- 
actions. The interactions of fields with the particles or, more 
generally, with other fields, are expressed by the source terms in 
inhomogeneous field equations. The problem is then to find the 
resultant field at a space-time point, given a distribution of sources, 
and the “initial fields” again everywhere but at a given time. (Chap- 
ter IV.) 

As the particles influence the fields by producing their own 
field as a result of their motion, the field itself, on the other hand, 
influences the motion of these sources. The first is observed as 
radiation and the second as radiation reaction. This feedback 
between the particles and the fields is discussed in Chapter V. Here 
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also we come to questions concerning the limitations of the classical 
field theories and classical dynamics. For, irrespective of the prob- 
lem of complete applicability of the theory to nature—quantum 
effects, of course, will come in before these limitations become 
important—the question arises whether the classical theory of inter- 
acting fields and particles is a consistent theory. The classical non- 
relativistic mechanics is an example of a consistent theory; however, 
here the radiation and radiation reaction effects are completely 
neglected. With the inclusion of these effects, there are serious 
problems concerning the solutions of the equations of motion and 
the infinities occurring in the theory resulting from the mass-point 
character of the particles. The possible solutions of these problems 
are discussed. 

Finally, in Chapter VI, we discuss a purely mechanical approach 
to the problem of interactions of charged particles, namely the so- 
called action-at-a-distance formulation of electrodynamics, as in the 
ordinary mechanics. Here the field concept is eliminated, and the 
equation of motion of each particle is written down taking into 
account “forces” resulting from all other particles. 

This is then an outline and survey of the book. We have con- 
centrated on the development of the general theory of the subject. 
A number of interesting applications and related subjects have been 
included as exercises or further reading, with brief hints and refer- 
ences. For the background. material in electrodynamics, the recent 
book of J. D. Jackson, Classical Electrodynamics (New York: 
John Wiley & Sons, Inc., 1962) is recommended. For the material 
in this book, a more or less complete list of references is given. We 
remark finally that the problems of radiation and radiation 
reaction, and the resultant equations of motion have recently again 
attracted the attention of many physicists. 

To keep the material continuous, alternate proofs and additional 
topics have been inserted in the sections “Further Developments 
and Exercises” at the end of each chapter. Some applications in 
these sections as well as some intermediary steps have been left 
to the reader as an exercise. The student is urged to derive these 
steps and do the problems, which we believe to be of great pedago- 
gical value. 
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PART I 


Relativistic Description 
of Fields and Particles 
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« 


re 2 ee 
q cae aa 
& 
Leyte 


“a ORENTZ 


TRANSFORMATIONS 


1. The Physical Basis of the Lorentz Transformations} 


The physical laws governing the behavior of fields and particles are 
expressed in terms of space-time coordinates x, t and the functions of the 
coordinates.” On the other hand, the physical laws describe permanencies 
in nature that are independent of any coordinate frame: a physical process 
will take place irrespective of what coordinate frame the observer may 
choose from which to observe it. The arbitrariness resulting from the choice 
of the coordinate frames must be eliminated from the formulation of the 
laws. Consequently, the transformations between different possible frames 
of reference and the quantities which are invariant under these transfor- 
mations assume a fundamental importance. The physical laws will be 
written in such a way that their content and form is the same for a class 
of observers ; for an objectively identical situation only the numerical values 
of observed quantities may change from observer to observer. 

Frames of reference in which free particles move uniformly are called 
inertial frames of reference. This concept, formally, is the same both in 
classical mechanics and in the special theory of relativity.* 


1A partly deductive description of the Special Theory of Relativity is attempted 
here. For a historical development and references, see W. Pauli, Theory of Relativity 
(New York: Pergamon Press, 1958), Secs. 1 and 2. 

2 In contrast to this, the quantum-mechanical S-matrix theory attempts to formulate 
the laws of elementary particles, without space-time coordinates, in terms of directly 
observable quantities such as scattering amplitudes, and without the direct use of 
coordinates. Of course, Lorentz transformations are still used indirectly in momentum 
space. 

3 In the general theory of relativity even the inertial frames of reference are not 
singled out. These are eliminated by the use of a more general class of frames of reference, 


W. Pauli, foc. cit. 
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In classical mechanics the correspondence between different inertial 
frames is expressed by the Galilean transformation law: two coordinate 
frames moving with respect to each other with the constant relative velo- 
city w are related by 


x =Oxtw t'=14, (1.1) 


where O is an arbitrary orthogonal transformation and expresses the 
relative position in space of the two coordinate frames at t=0. The 
transformations, Eq. 1.1, which go hand in hand with Newton’s laws, 
define the class of classical inertial frames. These frames are characterized 
therefore by the totality of the set (O, w) with w any vector, O any ortho- 
gonal transformation. Newton’s laws are the same for all such inertial 
frames although the numerical values, say, of the coordinates of the 
particles are different. 

Other or more general physical laws (or theories) may not be valid 
for the class of coordinate frames defined by Eq. 1.1. A difficulty was 
created for physics when an attempt was made to test the validity of light 
phenomena for all Galilean inertial frames defined by Eq. 1.1. In the case 
of the propagation of the electromagnetic waves it has been established 
experimentally that the velocity c of the propagation is constant and the 
same for all inertial frames, contrary to the Galilean rule of the addition 
of velocities as derived from Eq. 1.1. On the theoretical side, the phenomena 
of light propagation are correctly described by Maxwell’s equations 
which are not valid for all Galilean inertial frames, or mathematically 
speaking, which are not invariant under the transformations Eq. 1.1. 

This difficulty was solved by the special theory of relativity. The 
Lorentz transformations developed below will define a new class of inertial 
frames for electromagnetic phenomena and, because it is believed that the 
relations of physical phenomena to observers is the same for all physics, 
for all other phenomena until we are met with a new difficulty. 

The study of the transformations of the type of Eq. 1.1 adds a new 
dimension to physical theories. On the one hand, one can study the 
allowed transformations for a given theory; on the other hand, one can 
try to construct new theories which will allow a given set of transfor- 
mations. All theories are associated with a set of transformation laws that 
define the class of observers for which the theory is valid. Relativistic 
theories of mechanics, hydrodynamics, thermodynamics, statistical 


: 1 Recently new experiments have been performed with masers, which are about ten 
times more accurate than the early Michelson-Morley interferometer experiments. 
(i P. Cederholm, G. F. Bland, B. L. Havens, and C. H. Townes, Phys. Rev. Letters, 1, 

1958), 342. 
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mechanics, and so on, are all generalizations of the corresponding non- 
relativistic theories with Eq. 1.1 replaced by the Lorentz transformations. 

After this background we go on to the development of Lorentz trans- 
formations. Because the transformation laws, Eq. 1.1, are empirically 
valid in the low-velocity domain, the Lorentz transformations we are 
seeking must reduce to the form Eq. 1.1 in the limit of small velocities. 
To find the proper generalization, let us consider the example of light 
propagation in detail. 

For a light signal propagating as a spherical wave the equation of the 
wave front is 


71? — x? — x3 — x3 =0. 


The constancy of the velocity of light c implies now that in any other 
inertial frame to be defined, which at t = 0 coincided with the first, the 
equation of the wave front is 


c7t'? — x}? — x? — x? =0, 


i.e. again a spherical wave with the same velocity c. This is only possible 
if t' #1, because x; #x;. Thus, the transformation law (x’, t’)<(x, 2) 
must be such that 


x2—r?=0 implies xi? —r'2 =0, and vice versa, (1.2) 


where we have put x, = cf. Furthermore, the transformations must have 
an inverse because we can interchange the roles of the coordinates (x, f) 
and (x’, t’) and map the origin into the origin, as we have assumed the 
two frames to be coinciding at t = 0. Consider the values of x) and r such 
that x2 — r? =a ¥ 0(i.e., points not on the light front). The quantity @ has 
the dimension of (length)?. For dimensional reasons x’? — r’? must be, 
therefore, a linear function of a, and by previous requirements actually 
equal to a.’ The transformations which are uniquely determined by the 
single relation 


x— re =xP—r*=a,: areal, (1.3) 


are then the Lorentz transformations. We shall see that they will go 
over to Eq. 1.1 in the limit of small velocities. Eq. 1.3 also defines the 
set of inertial frames for electrodynamics and all special relativistic 
theories. 
From an experimental point of view the constancy of the velocity of 
light is not enough to obtain Eq. 1.3. Two other experiments must be 
1 Sometimes the requirement that the transformation is linear is also introduced 


but this has no immediate physical meaning except that ‘in the small” we may consider 
the transformations to be (approximately) linear. The linearity will follow from Eq. 1.3. 
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invoked to assert this, even for linear transformations. These are the 
Kennedy-Thorndike experiment? showing the independence of the transit 
time of light on any closed path on the velocity of the earth, and the Ives- 
Stilwell? experiment on the second-order Doppler shift, to be treated 
later. 

One can also start, as did Lorentz and Poincaré, from Maxwell’s 
electrodynamics and see under what transformations these equations are 
invariant. One again arrives at Eq. 1.3. In any case, whether derived 
inductively or deductively, we take Eq. 1.3 as the defining equation of 
inertial frames. Correspondingly we define all transformations that leave 
the expression 

x? = x2 — x? — x2 — x? 
invariant as homogeneous Lorentz transformations. More generally the 
inhomogeneous Lorentz transformations are defined to be those which 
leave the expression 


(x = y)? = (Xo — yo)? — (%1 — 91)? — 2 — 2)? — %a— Ys)? 
invariant. These differ from the homogeneous transformations by a trans- 
Jation of space and time coordinates. The special theory of relativity has 
been introduced here in coordinate space, as was the case historically. 


One can equally well introduce it in momentum space starting from the 


frequency and propagation vector of light waves (see Sec. 4.C of this 
chapter). 

A thorough knowledge of the properties of Lorentz transformations 
and their representations is necessary for the following chapters. The next 
sections are devoted mainly to these mathematical questions. 


2. Mathematical Properties of the Lorentz Space 


Notations 


, We introduce a 4-dimensional vector space of four-vectors x with 
components x°, x’, x?, x°, and define a real norm (or metric), 


x= =x Se (1.4) 
and a scalar product of two four-vectors x and y, 
xy = x°y® — xly! — x2y? — x33, (1.5) 


1H. P. Robertson, Rev. Mod. Phys., 21, (1949), 378. 
2 R. J. Kennedy and E. M. Thorndike, Phys. Rev., 42, (1932), 400. 
* H. E. Ives and C. R. Stilwell, J. Opt. Soc. Amer., 28, (1938), 215; 31, (1941), 369. 
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The vectors transformed under a Lorentz transformation L will be denoted 
by x’ = Lx, y’ = Ly, --- . From the postulated invariance of the norm; 
i.e., (Lx)? = x?, we can deduce the invariance of the scalar product (1.5) 
under Lorentz transformations. For this purpose consider the vector 
x + y. Because 


[Lx + y)]? = + y)’, 
we obtain, by squaring both sides, 
Nae x yi aoys 

There is a one-to-one correspondence between the vectors of the four- 
dimensional Euclidian space R* and this Lorentz space L*.! 

Every four-vector x will have a positive definite Euclidian norm 
(x, x) > x°? + x!? + x2? + x*? in R* and an indefinite norm, Eq. 1.4, in L*. 
Eq. 1.4 can also be written within the framework of a positive definite 
metric as 


Gk): xy = (x, Gy), (1.4’) 
where G is the 4x 4 matrix 
1 0 0 0 
0 -1 0 0 
as (1.6) 
Samo 1 0) 
0 -.0 0 -1 


with the matrix elements go99 = —9, =1, 9,, =O if ux v.2 Thus we shall 
use, equivalently, 


xy = (x, Gy) = Dag" 9" — gx Vy a ee (1.5’) 


where the so-called covariant components x, of the coordinates are given 
by 
| “3 3 
Xu = Guy's ng — Xs x Ve 
with g""g,, = g4 = 6%. The contravariant components of x are x". 


These various notations for the norm and the scalar product are 
convenient and common. We must observe throughout the distinction 
between the covariant and the contravariant indices. 


1To distinguish from the Minkowski space (x!, x?, x3, x4 = ict), we denote the 
space with the indefinite norm (4) as the Lorentz space. To avoid the use of complex 
numbers, we shall use throughout the metric (1.4.) 

2 We shall use Greek indices p, v, °-* 9, p, -*: to run from 0 to 3 and Latin indices 
i,j,k, °++ to run from 1 to 3, and the summation convention for both types of indices. 
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Vectors and Scalar Products 


We divide the vectors x according to their norms into three groups: 
x?>0, time-like vectors, 
x? =0, light-like or null vectors, 
x? <0, space-like vectors. 


All light-like points form the surface of a three-dimensional cone in the 
four-dimensional space, called the light cone. The light cone can be 


Forward light-cone 


Time—like 


Light-like 
Space-like 


Backward light-cone 


Fig. |. Lorentz Space and three kinds of four vectors 


visualized as follows: a spherical light wave diminishing down to the 
origin (backward cone) and then expanding to infinity (forward cone), 
all, of course, with the velocity c. Vectors with x? = a = constant lie on 
three-dimensional hyperboloids. The two-dimensional picture in Fig. 1 
schematically illustrates these groups of vectors. 

All time-like vectors are inside the light cone and the space-like ones 
are outside. The slope of the line connecting any two points indicates the 
velocity of the signals between the points. Thus space-like points cannot 
be reached from each other by signals with velocities equal or less than c. 
This is the so-called causality requirement which plays an important role 
in relativistic theories. 
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The invariant scalar product xy = (x, Gy) = x,y" may be completely 
defined by the following two properties: 
xy — yx. (ax + By)z = axz + Byz; a, B real numbers. (1.7) 
We then obtain by simple algebra the following identities: 
(x + y)? = x? + y? + 2xy, 
(x + y)? + — y)? = 202? + y), (1.8) 
(x + y)\(x—- y) =x? -y’. 

Two four-vectors x and y are called orthogonal if xy = 0. Thus every 
light-like vector is orthogonal to itself. It follows from the above identities 
that the sum and the difference of any two light-like vectors are orthogonal 
and have equal and opposite norms. Two orthogonal light-like vectors are 
at most multiples of each other, for from xy = 0 and x? = y? =0 it follows 
that x-y = (x*)!/?(y?)'/?; hence the spatial parts, and therefore the vectors, 
are proportional to each other. A simple way of constructing arbitrary 
orthogonal vectors is to take any two vectors of equal norm and form the 
sum and the difference of them. 

We shall now prove some more useful relationships between the 
vectors. 

(1) All vectors x orthogonal to a time-like vector z are space-like and 
form a three-dimensional space-like subspace, for from x°z° = x-z, we 
get 

|x°z°| [xz] [xez| 
Be ie a’ 


because z is time-like, or, squaring both sides, 


2 
X°Z 
o | | 


cS can 
Es 


hence x is space-like. By the same argument, every vector orthogonal to 
a light-like vector is space-like, except the previously mentioned multiple 
of the light-like vector itself. There are only two orthogonal space-like 
vectors, both orthogonal to the light-like vector. Obviously there are 
three mutually orthogonal space-like vectors. This completes the ortho- 
gonality properties of the four-vectors. 

(2) Analogous to the decomposition of ordinary vectors into ortho- 
gonal components, we can represent every vector x in the form 


x=y+ Tz, (1.9) 


where z is a fixed time-like vector and y a unique space-like vector 
orthogonal to z determined by x, and 1 is a real parameter. t and y are 
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then the components of x with respect to z. If z:(1, z) and y: (0, y), then t 
is the time component of x. t can be considered as an invariant time 
coordinate because the decomposition, Eq. 1.9, is invariant under Lorentz 
transformations. Similarly y will be the spatial, three-dimensional part 
of the four-vector z in invariant form. 


yo=x-tZ=x—2(xz). (1.9’) 


(3) The sum of two time-like vectors z,, z, is again time-like if both 
vectors are on the same half of the light cone (z?, z? have the same sign), 
otherwise it is time-like or space-like, depending on whether (z? + z$)? 
> (z, +22), or < (zi + 23), or light-like. 

(4) The invariant Schwarz inequality holds for two time-like vectors 
in the inverse form; i.e., 

Gigs)? ice (1.10) 


equality holds if and only if z, and z, are parallel. To show this we can 
assume without loss of generality z, to be (1000). Then z? 
which proves the statement. 


For one space-like vector y and one time-like vector z we have 
(y27* Sy" (1.11) 
Schwartz’s inequality does not give anything new if one or both of the 
vectors are light-like. For two space-like vectors y,, y, the situation is a 
little complicated. Taking y,:(0, 1, 0, 0), we see that for y?* + y?” < y%” 
< yl’ + y?? + y?’ we have the inequality 


a ae 1 
22; —2; 


(y1¥2)? = yiy3; (1.12) 
otherwise, the usual inequality? 
(yi¥2)" < yiy3. (1.12’) 


(5) The form of the metric, Eq. 1.4 shows that the time and the space 
coordinates are on a different footing. The special theory of relativity 
says only that any time-like vector is physically as good as any other as 
the time coordinate, if no time reversal is involved. Therefore, we call 
two time-like vectors equivalent if z,z, > 0. This means that z, and z, are 
both in the forward or in the backward light cone. This equivalence 
relation is reflexive, commutative and transitive. To prove the transitivity ; 
i.e., if z1z2 >0 and z,z, >0, then z,z, >0, we can take Z2:(I, 0, 0, 0). 
Then z? > 0 and z$ > 0 implies z,z, > 0. Note that the sign of the scalar 
product is determined by the sign of their zero components. 

1 W. Graeub, Lineare Algebra (Berlin: Springer Verlag, 1958), p. 167, always assumes 


Equlri2: However, one can give counterexamples that hold for (12): For example, 
(0,1,0,0) and (/3/2,1,1,0) satisfy Eq. 1.12, but (0,1,0,0) and (1,1,1,1) satisfy Eq. 1.12’. 
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Basis 


In writing the norm and the scalar products in Eqs. 1.4 and 1.5, we 
have used the components of the vectors x, y with respect to the four 
basis vectors (1, 0, 0, 0), (0, 1, 0, 0), (0, 0, 1, 0) and (0, 0, 0, 1). This is not 
necessary. The Lorentz space can be characterized directly by the pro- 
perties of the scalar product given by Eq. 1.7 and the nonpositive definite- 
ness of the norm. Moreover, any four linearly independent four-vectors 
can be used as a basis. In particular, we introduce an orthogonal basis, a 
set of 4 four-vectors e“ (A = 0, 1, 2, 3) such that any vector x can be 
written as 

x = ce), (1.13) 


It follows from our previous considerations that one of the e* has to be 
time-like and the other three are then necessarily space-like. Light-like 
vectors cannot be used. Two or three orthogonal space-like vectors span a 
two- or three-dimensional space-like manifold; i.e., every vector on the 
manifold is space-like. Time-like and light-like manifolds are one- 
dimensional. Thus, in order to represent every vector in the form of 
Eq. 1.13, e™ must satisfy’ 


everte agi, (1.14) 
Consequently, the contravariant components of x in an arbitrary ortho- 
normal basis e™ are given by 
= xe), (1.15) 
In the decomposition, Eq. 1.9 we can take, for example, z to be the time- 
like leg of the basis. 


Surfaces and differential properties in Lorentz space will be discussed 
in Sec. 8. 


*Complex Lorentz Space? 


The complex Lorentz space has found applications in relativistic 
quantum theories. We add to every real four-vector x an imaginary four- 
vector iy and write 

E=x+iy. 
Two types of generalization of Lorentz space can now be introduced. One 


1 Such a basis is denoted in the literature sometimes as “‘Vierbein” or “four-leg”’. 
2 Sections or chapters with * can be omitted at first reading without any effect in 
the reading of the following chapters. 
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is the ordinary complex vector space with the real but nonpositive definite 
norm, where the scalar product is characterized by the equations 


én= ne; 
E(an, + Bn2) = Hn, + Ben,; ~* 4, B complex numbers ; 
(aE, + BE2)n = a€4n + Boon, 
and gives for the norm ¢? = x? + y’. 
In the other generalization the scalar product does not involve any 
complex conjugation; i.e., 
En=n&; E(an)=atn; + (acy = adn, 


and the norm is complex: €? = x? — y? + 2ixy. 
Both spaces reduce in the limit y = 0 to Lorentz space. 


3. Properties of the Lorentz Transformations 


The homogeneous Lorentz transformations leave the scalar product 
_xy invariant; ice., 


(Lx, GLy) = (x, Gy) = xy; (1.16) 
they are thus completely characterized by the matrix equation’ 
EGL =4G. (1.17) 


For real transformations, which we are considering, L* = 7. In the 
metric, Eq. 1.6 all the matrix elements of L are real.” 
From Eq. 1.17 we find that 


det L = +1. (1.18) 


Because the identity transformation has determinant +1, all Lorentz 
transformations generated continuously from the identity will have deter- 
minant +1 and are called proper homogeneous Lorentz transformations L,. 
Transformations including reflections may have determinant —1 and will 
be denoted as improper Lorentz transformations L_. In the notation 

1 4+, AT, A* will denote respectively the hermitian conjugate, the transpose and the 
~ complex conjugate of the matrix A. 


® One can also define an adjoint L to L with respect to our indefinite scalar product 
xy by 
(Lx)y = x(Ly). 


Then the Lorentz transformation are “unitary” in the sense that LL = I, where 
L=G "LG. 
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xt" = [Xx” the matrix elements of L satisfy the following relations: 
3 
ie i Y Ls =1, 
5 k=1 
LY - yyy =-1, k=1,2,3, (1.19) 
m=0 


oye: y — 0, U # V, 

which follows from Eq. 1.17. Thus, the elements in the first column of L 
can be considered as components of a unit time-like vector [first line of 
Eq, 1.19], and those of the three other columns as components of unit 
space-like vectors [second line of Eq. 1.19]. These 4 four-vectors are 
mutually orthogonal [third line of Eq. 1.19] and form a basis, namely, 
that obtained from (1000), (0100), (0010), and (0001) by the application 
of L. 

We divide the Lorentz transformations, according to the sign of L°,, 
into two groups: orthochronous transformations L* with L°, = 1,-and 
antichronous transformations L, with L°, < —1. Notethat |L°o| = 1 always. 
Under L* the sign of the zero component of time-like vectors are un- 
changed. This is clear if the time-like vector x is (1000). In the general 
case we have 


ane? s (51%) (52°) -e3-o Ee < 18 < LR". 
k k k 
where use is made of Eq. 1.19 and the fact that x is time-like. Hence 
[Lo x°| > |L°,x*]. 
In other words, Lt maps one time direction into an equivalent time 
direction; it operates within an equivalence class of time-like vectors. On 


the other hand, L, changes the sign of x° and connects the two equivalent 
classes. 


Inhomogeneous Lorentz transformations. These transformations can be 
written in the form 
x’ =Lx+a= 2x (1.20) 
where a is an arbitrary four-vector. In this case, not x? but the relative 
norm (x — y)? is invariant and the origin is mapped into a. Because the 
inhomogeneous Lorentz transformations include the translations, they 
are physically more important. Any # is obtained as the product of a 
translation T and a proper Lorentz transformation L. We write sym- 


bolically 
# =(a; 1). (1.21) 
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The identity transformation is (0, J). For the product of two inhomo- 
geneous transformations (a,, L,) and (a2, L); i.e., 


x" = L,x' + a, =L,(L,x + a,) + 4a, 
we get the rule 
(az, L,)(a,, Ly) = (a2 + L24;, L2L1) = (a, L), (1.22) 


which can be written in the matrix form 


1 oO om @ 1 0 
= . 122" 
a a i? bs bom + L3a, _) ( ) 


Thus, the inverse of (a, L) is (—L~ ‘a, L~’). 


Eigenvalues of the homogeneous Lorentz transformations. These are 
defined by the equation 


Ei dy, (1.23) 


or by the roots of the equation det (L — 27) = 0. Let v, and v, be any two 
-eigenvectors with eigenvalues A,, A. Then 


(Lv,)(Lv2) SU SS AAV 102 q v? = 4707. 


We have the following possibilities: (a) If v is not light-like, then 4? = 1 
or A= +1. (b) If both v, and v, are not light-like and 2, 4A, then r, 
and v, are orthogonal. This is also true if v, and/or v, are light-like, but 
with A, # A,, and if both are different from zero. (c) If A, is an eigenvalue, 
so is 1/A,. The eigenvectors of complex Lorentz transformations can be 
discussed in a similar fashion.’ (See Exercise 2.) 


Decomposition of homogeneous Lorentz transformations into plane 
reflections. We define a special Lorentz transformation }’, by 


Yxax'=x—-2—Sn, n? £0, (1.24) 


for all x. n is a four-vector, and }’, represents a reflection of the vector x 
through a hyperplane orthogonal to n. For example, if n:(1000), then 
x° = —x°, x’=x; hence ), in this case is the time-reflection trans- 


formation. )', will be called a plane reflection or a symmetry. We derive 


LE. P. Wigner, Annals of Mathem., 40, (1939), 149, p. 160. 
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the following properties of )’, from its definition, which justify this nota- 
tion and which are self-explanatory: 


OS yas 


(b) },n=—n. 
(c) )), x =x, if (nx) = 0, i.e. if x is “orthogonal” to n. 
=i. 


Furthermore, for two vectors n and n’ = Ln, the corresponding symmetries 
are connected by a similarity transformation with L; i.e., 


Lk a, n’ = Ln. (1.25) 
To prove Eq. 1.25, we form 


; -1 
LUE tea LYy=1(y—2 Pn) = Ly 2 tne" n 
n n n 


(n'x) , 
axe 2 52 n= x—2 a ax: 
It follows from Eq. 1.25 that det ), = det }',, and tr), =tr),. By a 
suitable L we can always bring a time-like n into the direction n’: (1000). 


Hence we see that 
dty=-1, try =2. (1.26) 


Therefore, if n is time-like, )', belongs to the L? class of Lorentz trans- 
formations; similarly, if n is space-like, ), belongs to Lt. Reflection 
throught a light-like vector is not defined because there is no hyperplane 
orthogonal to a light-like vector. 

The matrix elements of Ye are given by 


(= 8-2 


In particular, we introduce four reflections }'9, )1, )2, 3 of the four 
coordinate axes, respectively; i.e., for m: (1000), (0100), (0010), and (0001). 
The product )",, )’, reflects two coordinates (u # v) and Y', ), Ye = —Dos 
L, v, 6, p all different. 

In terms of these symmetries we have the following decomposition 
theorem:' Any homogeneous Lorentz transformation L can be written 
as a product of 4 symmetries at most. 


n 


ony : 
as (1.27) 


1L,. Michel, Nuovo Cimento, S14, (1959), 95. This is a special case of a more general 
theorem: Linear transformations in an n-dimensional space leaving the nondegenerate 
metric gijx‘x/ invariant can be written as a product of m symmetries at most. 
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Proof: 

(a) If there exists an invariant vector u, i.e., Lu =u, u” # 0, then every 
vector y orthogonal to u remains orthogonal after the transformation; 
j.e., yu = 0 implies (Ly)u = 0. Hence L transforms the three-dimensional 
subspace orthogonal to u into itself and, by induction, L is the product 
of three plane reflections at most. One can continue this procedure down 
to one dimension. If at any stage there is no fixed vector x, one proceeds 
as follows. 

(b) There exists no invariant u, but a vector y such that Ly —y =a is 
not light-like. Then the product transformation ()'.L) has a fixed vector, 
namely the vector y itself, for 


aLy (Ly — y)Ly 
wLy = Ly -2—- a=Ly - 2, (ly - 
Lely = Ly - 2-5 y (ly, ) 
“hy —)) ay aa, 
(Ly — y) 


Hence YL is the product of three plane reflections at most, or L is, in 
this case, the product of four plane reflections at most. 

(c) There remains only the case where no such y exists; ie., Ly — y 
is light-like for all y. In this case L = J, which is the product of two 
reflections. 

Another decomposition of the homogeneous Lorentz transformations 
into rotations and special proper transformations will be discussed in the 
next section. 


4. Special Lorentz Transformations. Applications 


The explicit form of the following particular Lorentz transformations 
will be useful in actual applications. 

(a) Three-dimensional orthogonal transformations’ (L3=1, Ly = 
L* = 0) belong to the class L*, the rotations to L‘,, and the reflections to Lt. 
The space-reflection transformation will be denoted by Lp = —Yo- 

(b) Time reflection Ly =) Lt. 

(c) Space-time reflection Lpy = LyLp = —I. 


1 For a three-dimensional rotation around a space axis m and by an angle ¢, we 
have the transformation law 


lee 
x’ = [cos ¢ + (1 — cos $)nen + sin gn X x]. 
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(d) Transformation into a moving frame of reference. Consider a co- 
ordinate system K, moving with a constant velocity W along the x’ 
direction with respect to another coordinate frame Ky. A physical point P 
will have the coordinates x and x’ with respect to Ky and K,. Then 
x' = Lyx and Ly will have the form 

or 0 0 

~_—— 7, Tee” 0 
eu tO 
0 0 01 


We have four matrix elements to determine. The defining property of the 
Lorentz transformations, Eq. 1.17 or 1.19, gives us three equations, 
gt, SS 1, Lo DPS 2 by. 
In addition we have the kinematical condition that the origin of K, 
(x°, Wt, 0, 0) is transformed into (xj, 0, 0, 0). This means that L'yx° + 
L ,(x°/c)W = 0 for all x°, or 
W 
L', = —L', —. 

0 ite 

Solving these four equations for the four unknowns, we obtain 


a) a 
| —py 7 UO al ar 
Ly = =Liy, (1.28) 
0 oS ft 60 
0 0. O00 


where y =1/(1 — B?)'/?, and B = W/c < 1. Ly belongs to the class Lt and 
goes over to the Galilean transformations Eq. 1.1 for W <c, as it should. 
We shall discuss below some applications of this important transformation. 
It played an important role in the development of the theory. In particular, 
it occurs in the works of Lorentz and Poincaré on the invariance of 
Maxwell’s equations and, of course, in the work of Einstein. 

The columns of Ly form a basis in Lorentz space. The eigenvalues of 
Ly are A=[(1 — f)/(1 + B)}'/7 and 4~'; hence the corresponding eigen- 
vectors are light-like. 


(e) Infinitesimal inhomogeneous Lorentz transformations. These are 
proper Lorentz transformations differing infinitesimally from the identity, 


x’ =x+oaxte, (1.29) 
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where @ is an infinitesimal matrix and e an infinitesimal four-vector. The 
condition Eq. 1.17 implies for the homogeneous part 


wotG+Go=0, (1.30) 
or 
OG= =e,,., where O,y = Jus’ - (1.30’) 
Thus the infinitesimal homogeneous transformations are determined 
by six parameters,’ the inhomogeneous ones by ten parameters. 
Next we shall discuss some physical applications of the transformation 


ioe 


(A) Relativistic Addition of Velocities. Consider a coordinate frame K, 
moving with respect to Ky with a velocity W, and a frame K, moving 
with respect to K, with a velocity W,. What is the relative velocity W 
of K, with respect to Ky? Let the corresponding Lorentz transformations 
be denoted by L,, L, and L, respectively. Then we have Lx = L,(L,x) for 
all x, or 


ile = ESE, , 


where each of the L, L,, L, is given by Eq. 1.28 with the parameters y, y,, 
and y, respectively. A matrix multiplication and comparison of the matrix 
elements of both sides gives, after some calculation, 


p= B, +B, 


crags (1.31) 


(B) Decomposition of proper homogeneous Lorentz transformations. We 
want to decompose any homogeneous transformation L into a product of 
rotations and a Ly. Because L connects frames that are, at most, moving 
uniformly with respect to each other, we can first bring the direction of 
the relative motion into, say, the x’ axis by a rotation, and then apply Ly. 
The two resulting frames can then differ only by another rotation. The 
effect of these three operations is the same as that of L. Hence every L 
can be written in the form 


L = R,LyR, > (1.32) 
where R, and R, are three-dimensional rotations. In this decomposition 


1In general the number of parameters for n-dimensional orthogonal or pseudo- 
orthogonal transformations is n(m — 1)/2. 
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Ly is unique and is given by Eq. 1.28, but R, and R, are not unique. 
Eq. 1.32 can also be written as’ 


L = Lake: (1.32’) 

As an example, we reconsider the problem of addition of velocities 
for the case where K, is moving in a direction that makes an angle « with 
the x' axis. Let R be the rotation by an angle «, so that the two x! axes 


coincide after this rotation. Then LRx = L,RL,, and, by comparing the 9 
components of both sides, we find 


(1 — Bi) — 63) 
(1 + BB, cosa)?’ 
which reduces to Eq. 1.31 for « = 0. Note that we cannot compare other 
matrix elements, other than §, because we have introduced only one 
rotation rather than two; the x? and x° coordinates may still be rotated 
with respect to each other. As another application of Eq. 1.32, we derive 
the form of Ly connecting two frames moving with respect to each other 
in an arbitrary direction with velocity w and B = w/c. We have 


Ly = R,LyR2 : 


BP =1- (1.33) 


where 


R and S are three-dimensional rotations, R carrying the x' axis into the 
direction of w; S is a rotation around w to make the x? and x* axes 
parallel. We get 


7 Fp — Bay —Bsy 
—Biy 14+AB,B ABB. ABB 
—B,y ABB, 1+AB2B, AB2Bs3 
—Bsy ABB, AB 3B. 1+ AB3B3 


where A = (y — 1)/B. Note that Ly = —L7y, as it should. It is sometimes 
convenient to put in Eq. 1.34 


Ly = , (1.34) 


y = cosh X, — By = nsinh X, 
then 
x = x° cosh ¥ + x-nsinh X, 


x’ = x°nsinh X + [x — n(x-n)] + nx-ncosh X. 
1 This follows, for example, from the polar decomposition of 2 x 2 unimodular 


matrices A which are homomorphic to restricted Lorentz transformations. See Section 
5B and Exercise 18. 
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(C) Doppler effect and abberation. So far we have considered the 
transformations of the coordinates x only. Physical equations involve 
other quantities beside the positions of the mass points in space-time. 
These quantities must also transform between inertial frames to guarantee 
the invariance of the physical laws. Consider an electromagnetic plane 
wave of the form 


@ _ Ade) 


where @ is an electric or magnetic field, or the electromagnetic potential, 
@ is the angular frequency, and k is the propagation vector. All pheno- 
mena, such as the vanishing of a light signal, or light reaching a maximum, 
etc., must be the same for all inertial frames. These processes are deter- 
mined by the phase « = (w/c)ct —k-r of the wave. Thus, « must be an 
invariant quantity under the Lorentz transformations. This suggests the 
introduction of a four-vector k : (w/c, k) so that @ is given by the invariant 
scalar product 


oo SHhem(ke Gx) (1.35) 


All that we have said about the Lorentz space of x vectors will apply 
also to the space of the four-vectors k. In particular, from its construction, 
we see that k is a light-like vector : k? = (w?/c?) — k? = 0. As a matter of 
fact, the name “light-like” originates here; k fs the propagation vector 
of the light. 

Let us look now at our plane wave from a different moving frame of 
reference. The propagation vector in this frame will be 


kt = ie (1.36) 


If the coordinate frame is moving, say, in the x! direction, we can use 
Eq. 1.28 to write Eq. 1.36 in component form. If « is the angle between k 
and the x’ direction and a’ the angle between k’ and the x! direction, we 
obtain 


__.1—Bcosa 
0 py 
B+ cos a’ 
1+ B cosa” 


i 


(1.36’) 


cos ¢ = 


The first equation gives the change in frequency, the second gives the 
change in direction of the wave as observed from a moving coordinate 
frame. 
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Introduction of the special theory of relativity in momentum space. 
Directly observable quantities of an electromagnetic plane wave are w 
and k, defined above, satisfying 


wjc = (ki + k3 + k3)!?, 
Starting from these quantities and noting that a plane wave is a plane 
wave in every frame of reference, we arrive at the analogues of Eq. 1.2; 
Viz., 
(w/c)? — k? = 0 implies (w'/c)? — k'? = 0, and vice versa. 

Again, for dimensional reasons, the equation (w/c)? — k? = a? must imply 
(w'/c)? — k’? = a. That this generalization not only determines uniquely 
the Lorentz transformations, but also has physical significance, is seen 
from the existence of plane waves (de Broglie waves) corresponding to 


particles with nonzero rest mass. Compare the discussion in Chapter II, 
Sec. I. 


(D) Transformation of finite lengths and time intervals. We discuss now 
the appearance of lengths and time intervals as seen from a moving frame. 
(Fitzgerald contraction and time dilatation). Consider two points P,, P, 
in space-time’ with coordinates x,, x2 in one frame, and x}, x} in another 
frame moving relative to the first. Then 


X2 — X, = Lw(x2 — %4), 
or, for a motion in the x' direction, using Eq. 1.28, 
c(t, — th) = pet, — t1) — By&2 — x4), 
x?’ — x" = —Byc(t, — ty) + (x? — x’). 


A length measurement in any frame is defined as the simultaneous identi- 
fication of the endpoints of the interval. Thus, setting t, — ¢; =0 and 
x? — x! =], we get 


[= - 1= (1 — p?)"/21, (1.37) 


i.e., in units of the first system, the length in the second system is shortened 
by a factor (1 — B7)'/?. Similarly, for the measurement of a time interval 
at a given point, x?’ = x"’, and we get 


At’ = (1 — B?)'/?Ar, (1.38) 


1 Points in four-dimensional space will also be called events. 
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i.e., time lags behind in the moving frame relative to the rest frame, or a 
process of a certain duration will take a much longer time in a moving 
frame as compared to a rest frame. For example, the decay time of an 
unstable particle when it is moving is longer than when it decays at rest.’ 
The observation of Eq. 1.37 may be more difficult for the following 
reason: the eye records photons arriving at the retina simultaneously from 
all parts of the object. However, these photons have not been emitted 
simultaneously by all points because of thé motion of the object. This 
effect exactly cancels the Fitzgerald contraction for objects that subtend a 
small solid angle, and moving circular objects will still have an approxi- 
mately circular appearance.” For this and the much-discussed twin or 
clock paradox, see Exercises 1.4 and 1.5. 


5. The Lorentz Group and Its Representations* 


(A) The Four Parts of the Lorentz Group 


The set of all Lorentz transformations form a group. This means that, 
given two transformations, their inverses and their product are again 
Lorentz transformations. The identity transformation is included in this 
set. The inhomogeneous Lorentz group ¥ is characterized by the set of 
translations and homogeneous Lorentz transformations‘ [(a, L)], and the 
multiplication law Eq. 1.22 or Eq. 1.22’ shows this group property. Y has 
the subgroup L, the full homogeneous Lorentz group. The elements of L 
are explicitly given by the 4x4 matrices L satisfying Eq. 1.17; ie., 
L*GL =G. Also, on the basis of this equation, one may easily verify that 
L~? exists and also satisfies Eq. 1.17, and, together with Z, and L,, the 
product L,L, is also a Lorentz transformation. In addition, L* and L*™’ 
also satisfy Eq. 1.17. 

By Eq. 1.18 the homogeneous transformations are divided into two 
parts: proper transformations with det L = +1 and the improper trans- 
formations with det L = —1. The proper Lorentz transformations L, form 

1 This has actually been observed: Compare the lifetime of, say, 7 mesons at rest 
(2.56 x 10-8 sec) and mw mesons moving at a velocity v ~ 0.75c [3.8 x 10-8 sec, 
R. P. Durbin, H. H. Loar, and W. W. Havens, Jr., Phys. Rev. 88, (1952), 179]. 

2 R. Penrose, Proc. Camb. Phil. Soc., 55, (1959), 137; J. Terrell, Phys. Rev., 16, (1959), 


1041; V. Weisskopf, Phys. Today, 13, (1960), 24; Lectures in Theoretical Physics (New 
York: Interscience Publishers, 1960), Vol. III. 

3 No detailed knowledge of group theory is required for this section. Although some 
of the details of this section may be passed over on first reading, the essential ideas will 
be important in the discussions of tensors, spinors, etc. 

“We shall denote groups and transformations by the same letter: the group ¥ 
has elements -Y, etc. 
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a subgroup, the improper ones do not. This is a consequence of the 
following propositions: det J = +1, det L~! = det L, and det L, = +1, 
det L, = +1 imply det(Z,L,) = +1. 

Furthermore, the orthochronous Lorentz transformations L* with 
L°>1 form a subgroup of L, but the antichronous transformations L ch 
do not. Again we have to verify that the inverse of an element of Lt 
belongs to L* and the product of two orthochronous Lorentz transfor- 
mations is again orthochronous. Let us prove the second statement first. 
Let L and M be transformations belonging to L*; then 


(LM)°, = Le M*, > Ly M°, ae LBs M*), 
and by the Schwartz inequality for the spatial parts L°,, M*o, 
(LM)%o 2 LM ~ [¥(L°s)"]"7[ 5, (Mto)?] "2 1 
k k 


the last inequality follows from Eq. 1.19. Moreover, we see that if L°, > 1 
and (LM)°, = 1, then M°, > 1; thus the inverse L~! belongs also to Lt 
together with L. This completes the proof of the group property of the 
orthochronous Lorentz group L*. 

The intersection of Lt and L, is the restricted Lorentz group L*.. 
Therefore we obtain the following four parts of the homogeneous Lorentz 
group: L1, Lt, Lt, L*. These four parts are disconnected; that is to say, 
two group elements in different parts cannot be connected continuously 
with each other by varying the parameters of the elements. The inhomo- 
geneous Lorentz group ¥ is a ten-parameter group; i.e., to specify a 
general transformation, we need to specify ten real parameters. The 
homogeneous group L has six parameters.’ We see these most easily from 
the infinitesimal transformations, Eqs. 1.29, 1.30. 


(B) Correspondence with the 2 x 2 Unimodular Group 


We discuss now a very close correspondence (a homomorphism) be- 
tween the restricted Lorentz group L‘ and the unimodular group of 2 x 2 
complex matrices.” This will allow us to introduce the basic entities called 
spinors. Both groups are six-parameter groups, and the correspondence 
we are going to discuss is a generalization of the well-known homo- 


1 The groups of transformations depending on a finite number of parameters are 
called Lie groups. Other Lie groups besides the Lorentz group or the group of rotations 
have found important applications in theoretical physics. In contrast to this, groups 
like the gauge groups of electrodynamics (see later) depend on arbitrary functions and 
may be called infinite groups. 

2 This group is called in literature SL(2,C) 
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morphism between the three-dimensional rotation group and the two- 
dimensional unimodular-unitary group.' 

More precisely, we want to show that if A is a 2 x 2 unimodular 
matrix, then to +A there corresponds a Lorentz transformation L(A), 


YAS (Ale Lt: (1.39) 


and this correspondence preserves the multiplication law (i.e., it is a 
homomorphism in mathematical language): 

+(AB) — L(A)L(B) = L(AB). (1.39’) 
The homomorphism is two-to-one; both +A and —A correspond to the 
same Lorentz transformation. We shall show this important correspon- 
dence explicitly in the following form: 


Let X be any 2 x 2 Hermitian matrix. Any 2 x 2 matrix can be written 
as a linear combination of identity and three basic 2 x 2 matrices.” Let 


oS | 
> 


1 


X =x +x!'o, + x76, + x70; = ut ae 
x1 4+ ix? x°-—x 


where the coefficients x°, x’, x”, x? are real numbers and the basic matrices 
o have been chosen to be the Hermitian Pauli matrices 


( ) an(° =i ey 2 me 
6,.= ’ 2 a ? Sa A ° 
> Nites i Oo ( |) aa 


Writing ¢ = J, we shall use the short notation 
X =0,x". 

Now let A be an arbitrary 2 x 2 complex unimodular matrix; i.e., 
det A =1. 


Then X’ = AXA?* is again a hermitian 2 x 2 matrix and must be of the 
form 

X'=x"o,, 
Now det X’ = det X, or explicitly 


2 2 2 ‘ ’ ‘ ’ 
xO — xP — x2? — x3? = x0? _ yt? _ 2? _ 3? 


Therefore, the matrix transformation X’ = AYA* induces a transfor- 
mation on the coefficient x”, which is a Lorentz transformation LAOS 
L(A)x. To find it explicitly, we note that 
L'\x’o, = Ax"a,A* = x"Ao,A*, 
1 See, for example, H. Goldstein, Classical mechanics (New York: Addison-Wesley, 


1959), p. 110. 
2 See, for example, H. Goldstein, Ibid. 
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or, comparing the coefficients of x", 
L’\0, = Ao,A™. 
To obtain L",, we multiply both sides with o,, take the trace,’ and we 


get finally 
[L(A)}", = 3tr(o,,40,A*) = 4tr(6"Ao,A*) (1.41) 


This is the desired correspondence. Clearly the matrices A and —A gen- 
erate the same L(A)*,. We have only to show that L(A) defined by Eq. 1.41 
belongs to the restricted Lorentz group L*.. We see this as follows: Take 
first the time-like vector (1000), then ¥ = Jand X’ = AA* contain positive 
definite diagonal elements, which means that x° > x* 20; ie., L is 
orthochronous. Both A = +1 correspond to L = J, and because infini- 
tesimal A’s correspond to infinitesimal L’s, det L must be equal to one. 
Moreover, there is no A corresponding to space-reflections. For example, 
the vector X: (0001) and a space-reflection lead to the equation Ao,A* = 
— 03, which is a contradiction. The fact that det L = +1 can also be 
proved by formal manipulation starting from Eq. 1.41. (See Exercise 6.) 


(C) Spinors 


We introduce now the concept of group representations. Let us consider 
a group G with elements (a, b, c, --- ). If to every group element a there 
corresponds a transformation (or matrix) D(a) in some linear vector space, 


11In this calculation and others we make use of the following properties of the 
matrices Eq. 1.40: 
troop =2, trozx=0, k =1,2,3, 


ojox =io1; j,k, i cyclic, or ayox = 85x + iejxior, 
oyt = o, = on}, 


oxo = 2io, 


oyopor = 07941 — Ond;2 + o1d;e + iejet, 
or 


i 
ead 5 curb aoho* = Buy; of: (I, —9); ou: (I, 9). 


The quantities ejx2, euv08 are totally antisymmetric indices in three and four dimensions. 
tr(ojox) = 285x, 
tr(ojoxor) = 2iejxr, 
tr(cjox01¢m) = 2(84%82m — 8525¢m + 54 mdx1). 
It is also convenient, for sake of covariance, to introduce the matrices 


Gn =o or ot = oy 


such that ic 
tr(c#o,) = 2gHy. 
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called the representation space, in such a way that to the product (ab) of 
the group elements there corresponds the product of the transformations, 
ab «+ D(a) D(b) = D(ab), then the set of transformation {D(a)} is called a 
representation of the group G. The representation is faithful if to distinct 
elements a and b there corresponds distinct matrices D(a) and D(). If, in 
addition, we require D(1)=J, then D(a~') = D~‘(a), and we have a 
representation in which all matrices are nonsingular.! 

With this concept we see that the 2 x 2 unimodular matrices A dis- 
cussed in the previous section form a two-dimensional representation of the 
restricted Lorentz group L1. We already know that A(1)= +1. The 
equation A(Z,L,) = A(L,)A(L,) is an immediate consequence of the two 
equations X’ = A(L,)XA*(L,) and X"=A(L,)X’At(L,). This two- 
dimensional representation is faithful, but two-valued. That is, to every 
group element there correspond two matrices +A. 

Let us now consider the vectors in the two-dimensional representation 


space and denote them by 
eo 
c= ( , (1.42) 


where £1, &* are the complex components of the two-vector €. Under the 
transformation A(L), € transforms as 


t= A(L)E, det A(L) = +1. (1.43) 


Two-component quantities that, under Lorentz transformation, trans- 
form according to Eq. 1.43 are called spinors. This is a generalization of 
the well-known spinors corresponding to the three-dimensional rotations 
R, where the matrices A(R) are unitary in addition to being unimodular. 

Up to this section we have considered vectors a as the geometrical 
objects that transform under Lorentz transformations according to the 
law a’ = La, such as the coordinates x or momenta k. Spinors are new 
geometrical objects with a quite different, but definite, transformation 
property. The concept of group representations allows us to give a general 
characterization of all such objects. If D(L) is any finite- or infinite- 
dimensional representation of the Lorentz group, then we will have geo- 
metrical objects { (or tensors in the terminology of E. Cartan) transforming 
under Lorentz transformations according to 


¢’ = D(LY. (1.44) 
¢ will have as many components or indices as the dimensionality or 


1In quantum theories it is necessary to consider more generally representations up- 
to a factor, ab <> oa, b) D(a) D(6), where w(a, 6) is a phase factor. 
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indices of D. In the particular case of vectors, L itself is, of course, a 4 x 4 
representation of the Lorentz group. Other realizations of Eq. 1.44 beside 
Eq. 1.43 will be discussed in the following sections. 

In the case of vectors, we know that the scalar product (x, Gy) is 
invariant under Lorentz transformations. We look now for an invariant 
of the form 

(¢, Cn) = &*C, gn? (1.45) 
in two-dimensional spinor space (the indices a, B, --* are summed over 
1, 2). Because € and 7 transform according to Eq. 1.43, the invariance of 
Eq. 1.45 implies that C must satisfy the equation! 


A'CA=C, det A= +1. (1.46) 
Any antisymmetric matrix C is a solution of Eq. 1.46. For if C is 
antisymmetric, then A’CA is: 
Ay,C'A,s — —A5,C”A,g — —A5,CA,, . 
in the last step we have interchanged the dummy indices 6 and y. We take 
a standard form of C, namely 


01 
c= i) isam Chm Ct = =r (1.47) 


and, in analogy to G in Eq. 1.6, we call it the metric spinor.” 
We defined covariant components of vectors as x, = 9,»x"; similarly 
we introduce covariant components of spinors: 


by = Capt, (1.48) 
so that the invariant bilinear form is 
(¢, En) = o", = — San’. (1.45’) 
If €* = A(L)*,é°, then the covariant spinors transform according to.” 
epAla DRM p=Aé,. (1.49) 


1 Note that although ¢ and 7 are complex vectors, the bilinear form Eq. 1.45 is 
formed like that of a real vector space. The form Eq. 1.45 itself is a complex quantity. 
There is no invariant of the form é,C%?ng. This would imply A+CA = C {instead of 
Eq. 1.46], which has no solution for C. 

2 For any 2 x 2 matrix M the identity 

CMTC = M-'det M 
can easily be verified, which for unimodular matrices reduces to Eq. 1.46. 

3 Equation 1.46 is the counterpart of Eq. 1.17. In the case of vectors, if x = LHX", 

then the covariant components x transform according to L7~?, or 


Xp’ = Ly’xr, 


Ly? = guol.%og®” (in matrix form LT~* = GLG). 
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The matrices A’ ‘ form also a representation of the réstricted Lorentz 
group; they are unimodular and the correspondence ! 4 A? * preserves 
the multiplication law (AB)<«>(AB)" "= A’ 'B™~*. The two representa- 
tions A and A™ * are equivalent? because 


A=C7!AT'C, (1.46’) 


Thus covariant and contravariant spinors belong to two equivalent repre- 
sentation of L, AT‘ and A, respectively, both leaving the bilinear form 
Eq. 1.45 invariant. 

There are two more two-dimensional representations that also leave 
the form Eq. 1.45 invariant, namely A* and A*~'; i.e., altogether there are 
four matrices satisfying Eq. 1.46: A, A7 ‘, A*, A*~*. In the case of 4 x 4 
Lorentz transformations, the last two did not occur because L is real. Now 
we have two new representations which, however, are equivalent among 
themselves, 

At "= CARGR, (1.50) 


which follows from Eq. 1.46, but these are not equivalent to the represen- 
tation A. That is, there is no similarity transformation connecting A and 
A*. (In mathematical language A > A* is just an automorphism, not an 
inner automorphism, as was the case for A —> A™', or A*+ At’) 

Therefore, we have new objects transforming according to the inequi- 
valent representation A*. It is convenient to denote these quantities by 
the same symbol but with dotted indices: 


GM = AM ch (1.51) 


Now the covariant components transform according to A*~*. The metric 
tensor is C* = C, and the invariant form 


(€, Ci) = Sng = — Exn’. (1.52) 


Finally, let us look at the Lorentz transformations corresponding to 
these four different 2 x 2 matrices A, A™', A*, At~'. Let L(A) be the 
transformation corresponding to + A, then, from Eq. 1.46, 


L(AT’) = L(C)L(A)L(C7?). 


1 The mapping A + AT~1, in mathematical language, is an inner automorphism. 
* Two representations D and D’ are called equivalent if there exists a fixed matrix B 
such that 
D’ = BDB"; 


in particular, we shall speak of unitary equivalence if B is unitary. 
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From Eq. 1.41 we find that L(+C) =), )3, where Y,; are the plane 
reflections perpendicular to the axes, Eq. 1.27. Similarly, 


ga”) =LLLA))Y. 


Because we do not have two inequivalent representations in the 4x 4 L 
representation, we expect that L(A*) and L(A*~’) will be related to L(A) 
and L(A™'). Indeed, from Eq. 1.41 and using O7 =IGyy026"S2, We get 


L(A*)*, = 4 tr (6,A*o,A") =} tr [Ao A*07)"] = 4 tr(oT AoTA*) 


=4 tr[é,(¢,402)6,(0,A02)* ] 3 
hence 
L(A*)", = L(A™ '),det(a,0,) 
and, similarly, 
L(A*”')*, = L(A)*,det(o,o,). 


A representation D of the (Lorentz) group is called reducible if it has a 
semidiagonal or box form, or if it can be brought into this form by an 
equivalence transformation BDB™'. In this case, the representation space 
is split into a number of lower-dimensional subspaces which are trans- 
formed into themselves under Lorentz transformation. Clearly it is suffi- 
cient to determine all irreducible representations of a group. 


D. Higher-Order Spinor and Tensor Representations 


From the two-dimensional representation A(L) of the restricted 
Lorentz group, we can build up higher-dimensional representations suc- 
cessively. For example, in four dimensions we have quantities €*%, £¢*, £4 
and the related quantities with covariant indices, which transform under 
Lorentz transformations as the product of two corresponding spinors, 
e.g.,! 

G8 = AS APREM, Ce AM AEM, Cb = AM ATEM. (1.53) 
This process can be continued to any number of covariant and contra- 
variant, dotted and undotted indices. A general spinor will transform as 
follows: 

Cee eet dee ALAS AA OPE .. 
(1.54) 

In this same manner we can generalize the vector transformations 

x’" = L",x” to arbitrary tensors; i.e., quantities 7””, --- that transform as 


1 £98 transform according to the Kroenecker product A x A, £4° according to 
A* x A, ete. 


30 Electrodynamics and Classical Theory of Fields and Particles 
the product of vectors. A general tensor transforms according to the law’ 
Too. = Lee (1.55) 

The invariant form eis O04 generalizes to 


oR Oa SRE Se 


; CO ae. nD) 

As both the two-index spinors and the vectors refer to the four- 
dimensional representation of the Lorentz group, we must be able to 
relate the two. The general problem here is to construct tensors from the 
spinors that are more fundamental because of lower dimensionality. 

The relation between vectors and two-index spinors is already apparent 
from the way we introduced the spinors in Section B. There we considered 
Hermitian 2 x 2 matrices of the form 


X =0,x", (1.57) 
which transform under Lorentz transformations into 
X' = AXA". (1.58) 


The matrix elements of X transform clearly like a two-index spinor, one 
dotted and one undotted; in fact Eq. 1.58 written in components reads 


Xia = AgPAZ?X pg, OF Aap XPPAX,. 


Therefore, Eq. 1.57 written in components connects vectors with second- 
rank mixed spinors: 
Xai = FyaaX” (1.59) 


Oya, are the matrix elements of the Pauli matrices (1.40) A Hermitian 
second-rank spinor is determined by four real numbers like the vector x". 
Remembering that the spinor indices are raised and lowered by C,,, etc., 
we find the following relations connecting the metric tensor Guy and the 
spinor tensor C,, and the mixed quantities Gong: 


Gi 04' + ot,08) = me Gad 
On, = Guavene. (1.60) 
Gacpp = CapCag : 


1 Spinors and tensors may be defined under more general coordinate transforma- 
tions. For example, under arbitrary coordinate transformations x’ = x(x), tensors 
transform as 

Ox’ Ax’? @x® Oxt 
Gh I aes Le on 
= Ox™ axn ax’t ax’e Sil = 

A general theory of tensor and spinor calculus is not required in this book, because 

we are dealing with Lorentz covariant theories only. 
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The general connection between the tensors and the spinors is then 


Xee = 6, 4,0 5aX 3,07) v > 
(1.61) 

Kiem OF ot XT .0",. 
Thus spinor operations can re translated into the tensor operations and 
vice versa. Some examples of this translation will be discussed later under 
physical applications. For example, the quantity €,0%°€,, or in matrix 
notation éo,€, will transform under proper Lorentz transformations as a 
vector. Instead of the dotted and undotted indices, we can work com- 
pletely in matrix notation. Let us denote the spinors € and 2 transforming, 
respectively, as 


¢' = A, Sane: 
Let €=C~!2, then 
Gi Calga=C 1A*é = C71AtG2 = AE, (1.62) 
where 4 = C71A*C. 
When we write equations in matrix form they stand for both upper 
and lower indices; e.g., €’ = Aé stands for €” = A*,t? and j= Aes. The 
elements A? and 4% are different but related. Similarly, ,., and o4,, 0%, 


Gi are different matrix elements. We choose o,,, to be the usual matrix 


elements. 
Another way of introducing the higher-dimensional tensors and spinors 
is by writing down the general representations of the 2 x 2 unimodular 


group. Consider the spinors 


1 _ [Si i 
Pe mes 
which transform with A and A* respectively and form the normalized 
monomials' 
jr—my efi tims pi2—m2¢j2+m2 
Edirmgda tm gla—mes 


Ree (1.63) 


2" Ea my)! Ci + my)! C2 — M2)! (jn + mg)! P?’ 
1 This procedure is similar to that of finding the representations of the 2 x 2 unitary 
and unimodular group, i.e. the spinor corresponding to the three-dimensional rotation 
group. Explicitly, if the spinor = (2) transforms under the 2 x 2 unitary and uni- 
modular group U, then the normalized monomials 
filme girm 


transform with 9(/)(U), the (2j+ 1) x (27+ 1) matrix representation of U or Rs, 
the three-dimensional rotation group. 


M;® = SHE STIS ah 
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with —j, <m, < +j, and —j, <m, < +/, in integral steps. For given j, 
and j, the M’s form components of a (2j, + 1)(2j, + 1)-dimensional vector. 
Under a Lorentz transformation these vectors transform according to 


M' = G2 4)M, (1.64) 


where 2432) is a (2j, + 1)(2j. + 1) x (2/, + Ij. + 1) matrix and, in 
principle, can be worked out from Eq. 1.63. By varying j, and j, from 0 
to 00, we get all the representations of the unimodular group A. We have 


also 
QG) — Gli). 


(E) Infinitesimal Generators of the Lorentz and 
Unimodular Groups 


We have seen that the inhomogeneous Lorentz transformations have 
ten parameters, the homogeneous ones six. These parameters occur in the 
matrix elements of L (six) and in the translation vector a (four), Eq. 1.20. 
There are other, sometimes more convenient, ways of exhibiting the para- 
meters of the transformations. For example, we can write the proper 
Lorentz transformations in the form of an exponential matrix,! 


L = exp(a""M,,), (1.65) 
where both a”” and M,, are antisymmetric in the indices p, o, or in the 
form 

6 

Whig exp Sy aK,). (1.66) 

i=1 


In both these expressions a” and a; are six real parameters and I,, and 
K; are fixed 4 x 4 matrices. 

More intuitively, we can use rotations around a space axis mn by an 
angle @ 


L = exp(iJ-né), (1.67) 
and pure Lorentz transformations by a hyperbolic angle x [see Eq. 1.34], 
L = exp(iN: ny), (1.67’) 


} The following properties of the exponential matrix may be readily verified: 

(a) If K is a complex m x n matrix, the series eX = 1 + K+ K2/2! + K3/3!4--. 
is always convergent; 

(b) Det(e*) = etr kK; 

(c) eXtL — eke if [K, 20: 

(d) eSKS-1 _ Sek Sa): 

(e) If Ar, +: +, An are eigenvalues of K, then e’1, ---, e*” are the eigenvalues of eX: 

(f) eX* = (eX)*, eX? — (eM)T, eX+ — (eX)+, e-K — (eX)-1. 
; (g) eX > K is a mapping of a neighborhood of unity into a neighborhood of zero 
in the space of the parameters of K. 
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again exhibiting the six parameters and the six matrices J and N. In all 
these expressions the coefficients of the parameters are called the infinite- 
simal generators of the Lorentz group. If we expand the exponentials for 
infinitesimally small values of the parameters, and use the defining Eq. 1.17 
of the Lorentz transformations, we get 


K;'G + GK; =0, (1.68) 


and similarly for the other form of the generators. The problem of the 
representations of the group can be reformulated in terms of the generators 
by finding all the matrices satisfying this last equation. 

The different group generators do not commute, as the corresponding 
Lorentz transformations do not. These commutation relations can be 
worked out from the properties of the rotations. For example, for Eqs. 
1.67 and 1.67’, one finds 

ella al | = ied", 

J, N*) => 18 jqN', (1.69) 

LN, N*] = — ie _J', 
or, introducing a dual antisymmetric tensor by J: (M?*, M31, M'?) and 
N = (Mm, M®?, M”>), MY’ = —M", 

[M*4, M*") = —i(g**M*" a gM*# a gi*M”” = gM"), (1.69’) 

In this form we get back Eq. 1.65. M*” then correspond to rotations in 
the plane (uv), with the matrix elements 


(M")’, = —iLg""g, — 9°95]. (1.70) 


For the inhomogeneous Lorentz group we have, in addition to L, the 
translation 7, which can be written in the form 


T = exp(ia,P"), ._——° «(1.70 
where the generators P* commute among themselves, but not with M*’, . 
[M*", P°] = —i(g"’P” — g’’P"). (1.72) 


From this one can evaluate [J, P“] and NP"). 
Finally, we exhibit the generators of the 2 x 2 unimodular group. This 
is again a six-parameter group, and we can write 
Toe 1 
A= exp[i(M""s,0,)] = exp (5 jot 5 ie). (1.73) 


From Eg. (1.41), we see that exp(4J-o) generates pure rotations and 
exp(42.-a) generates pure Lorentz transformations. 
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To summarize, an r-parameter group of n x n matrices will have r 
linearly independent generators, which form an r-dimensional vector space. 
The group property of the matrices implies that the commutator of any 
two generators [K,, K,] also belongs to this vector space;! i.e., it is a 
linear combination of the other generators 


[K;, K,] = lees (1.74) 
f 


The coefficients k,;, characterize the group and are called the structure 
constants. Equations 1.69 and 1.69’ are examples of this relation. 


(F) Complex Lorentz Group 


At the end of Section 2 we have given two generalizations of Lorentz 
space in complex domain. Correspondingly, two types of complex Lorentz 
groups may be defined by the equations 


i Gia. L'GL = G, (1.75) 


respectively. The first one leaves a real norm (Zz) invariant, the second 
one a complex norm zz. We consider the second possibility. 

- In this section we shall write x, y for complex four-vectors. We con- 
sider again the 2 x 2 matrix 


X =0,x", 


which is no longer hermitian, as the x"’s are complex, and the trans- 
formation 
X’=AXB=<c,x'", det A=det B= 1. (1.76) 


Hence 
det XY’ = det ¥ = xX, = x" xy, 


Thus the transformation, Eq. 1.76, generates a complex Lorentz transfor- 
mation with x°* — x!? — x?? _ x3? — invariant, where x“ are complex 
numbers. In the same manner as in Section (B), we find the Lorentz 
transformation L(A, B) corresponding to two unimodular matrices A 
and B, 


L(A, i eae tr("Ao,B). (1.77) 
Equation 1.41 is a special case of this relation.” 
1 See exercise 7. 
? For applications of the complex Lorentz group in quantum field theory, see R. 
Jost, Helv. Phys. Acta, 33, (1960), 773; R. Jost, Physics in 20th Century (New York; 


Interscience Publishers, 1960). A. S. Wightman, Dispersion Relations and Elementary 
Particles, C. de Witt, ed. (New York; John Wiley and Sons, Inc., 1960). 
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Again the complex Lorentz group splits into two parts, one with 
det = +1, the other with det = —1. But now, in contrast to the real 
Lorentz group, Lt and L{ are continuously connected. For example, the 
complex Lorentz transformation 


cos@ ising 0 0 

ising cos¢d 0 0 ~ VSpex, 
0 0 cos@ sing] - 
0 0 —sing cos@ 


connects continuously the identity transformation with Lp7, space and 
time reflection. 


6. The Principle of Relativity: Invariance and Covariance 


The inertial observers will compare their coordinates (the space-time 
events) with the help of Lorentz transformations. The observers must also 
compare the physical laws. The physical laws must be independent of the 
coordinate frames, i.e., the same for all inertial frames. In other words, 
it is impossible to infer a particular inertial frame from experiment, for 
example, whether or not a coordinate frame is at rest or in uniform motion. 
This is the principle of relativity. We shall use it in the form that all physical 
laws must be invariant under the inhomogeneous Lorentz transformations. 

As a generalization of ordinary vector calculus (which expresses the 
fact that the vector equations are valid in any three-dimensional coordin- 
ate frame), we can write the fundamental equations of physical theories 
as four-vector, tensor or spinor equations such that the form of these 
equations is the same for all observers. In this form all-terms of the 
equations transform in the same manner. We will then say that the 
equations are covariant. 

However, it is important to realize that the relativistic invariance does 
not necessarily imply covariance. The equations need not be manifestly 
covariant; that is, the terms of the equations are not simply vectors, 
tensors or spinors; nevertheless, there is a definite law of transformation 
of these quantities. For example, one may use more general representa- 
tions of the Lorentz group, then the tensor or spinor representations, those 
involving an additional transformation, if the theory in question will 
allow other invariance transformations besides the Lorentz invariance. (See 
Chapter V, Section 3.) 
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Or, an equation may not be manifestly covariant because it has been 
reduced to a particular frame. If a covariant equation reduces to a familiar 
equation in a particular frame, then the former is a unique covariant form 
of the latter, for if an equation is valid in one frame, it must be valid in 
all frames. 

Examples of covariant and noncovariant, but invariant, equations will 
be given later in this book. 


7. Tensor and Spinor Fields and Momentum-Space 
Functions 


Particle properties are described by constant tensors or spinors in 
space, whereas fields are defined to be “objects” that are continuous 
functions of space-time or momenta of the particles. They can be classified 
according to their transformation properties: 


(a) Scalar fields, ¢(x). A scalar field describing some physical quantity 
has the same numerical value at each physical point P for all observers. 
If two observers denote the point P by x and x’ and the quantity by 
@ and ¢’, we have 

$'(x') = G(x). (1.78) 
In particular, there are invariant scalar fields; e.g., f(x”), f(kx), whose 
functional form is also invariant under Lorentz transformations. 


(6) Vector field ¢,(x). The physical vector quantity at some physical 
point P, again viewed by two different observers, will be denoted $),(x’) 
and $,(x), respectively. Although the physical vector is fixed, a component 
of this vector in one frame is a linear combination of the components with 
respect to the:rotated frame under L. Hence 


P(X) =LYO(x), _ $4(x') = L,",(0), (1.79) 
for the covariant and contravariant components respectively. 


Derivatives of a scalar field, 0p/dx* = ¢,,, transform like a vector 
field, for 


ror — 99'(%') _ 8G(x) Gx” 
Q! (x ) = eri = ox! ax'# = Ee, @ (x) ; 


similarly, $’"(x') = L",,"(x). 
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(c) Tensor field. The physical quantity at each space-time point has an 
array of components. It is a tensor field if the transformation law is the 
generalization of Eq. 1.79: 


ee 1 POL LT) (1.80) 


Tensors of the same rank may be added or subtracted. If a covariant and 
a contravariant index of a tensor are equal, they may be contacted (i.e., 
these indices may be deleted; the resultant tensor is of rank nm — 2. This is 
because the form $"@, is an invariant scalar. The derivative of a vector 
field is a tensor field of rank two: 


G(x’) = L*gL,?G%(x), 


which can be verified in the same manner as for 9,,. 
In the same way we can verify that forms like 


ea apeeeOr, ee Sugg 


7 
are invariant, for 
ox’* ibe Tobe Ox” 
Me OF INGE "ee a ns 
a GDS; . <a) ox ox: ox? Gx’? 


Se oer ones. (Xx) = rr) Sy. gO 


PET (XS TeX) 


In the case of Lorentz transformations, notice that we have set 


one ox” —1)\v . v vo = Vile 
ox” = Ls: éx"™ =(L- se = (GIG) uw—9 pon = L, ? 


thus, in our notation, 
— ID 
bf =07. 


(d) In terms of group representations we can express: the general 
geometrical objects in the following form. Let D(L) be an arbitrary repre- 
sentation of the Lorentz group; i.e., for two Lorentz transformations L, 
and L,, 

D(L2L,) = D(L2)D(L)). 
Let &* be the vectors in the space in which D(L) acts. Then we have fields 
transforming according to 


E'%(x!') = 6'(Lx) = D(L)%52°(x). 


Tensors are special examples of this transformation. Another class of 
quantity is the spinor field. 
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Spinor field. In matrix form the spinor field transforms as 
o'(x') = S(L)e(x) (1.81) 


where S(L) is a general spin representation of the Lorentz group. 

The previous transformation properties also apply to functions of 
other variables, for example, functions of the momenta of the particles. 
We have again tensors 

Boca 05 | 
and spinors 


on(k), 


transforming according to Eqs. 1.80 and 1.81 respectively, which are 
functions of one or more four-vectors k. 


8. Analysis 


In this section we shall discuss some of the differentiation and inte- 
gration properties of fields and surfaces in four-dimensional space. 


We introduce the four-dimensional gradient and divergence operations 
by 


0 
0 =0,6= $=, 
0p" = OO, = —- ‘a3 


similar expressions hold for other covariant or contravariant indices. 
Note that 6, = 0/dx" is a covariant and 6” = 6/dx, is a contravariant 
vector. 

The gradient vector of a scalar field 0,¢ is “orthogonal” to the three- 
dimensional surface in the sense of our metric, for 


3, dx" = do =0; 


if dx" lies on the surface, ¢ is a constant. 
The covariant generalization of the “curl” of a vector field is the anti- 


symmetric tensor 


G,)(x) = 8,0,(x) ion OO, (x) a —G,,(x). 


In three dimensions this reduces to the usual definition of the curl, 


G,; =0:6; — 0;9;, with G,, = (V x ¢),, plus two other equations obtained 
by cyclical permutations. 
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From the vector 0, we can form formally and operationally a scalar, 
its “norm,” 
6? 


ax°’ 


WO,=07=0?= Ae 


(0? is an invariant differential operator. 
The curl of a gradient is always zero, for 


0,8 aa OP =0 3 
however, the divergence of the curl is not zero. It will be written as 
GY, or GY = é¢", — 27¢". 

The integral theorems of vector analysis can be generalized to four 
dimensions also. 

The vanishing of the closed line integral § ¢,dx" = 0 implies ¢, = ¢.,, 
for § p, dx" = § dp = 0. 

The four-dimensional volume element will be written as 

dx = dx® dx! dx? dx}. 
The boundary B of a four-dimensional region R is a three-dimensional 
surface which can be described either by three parameters u,, uz, u3, OF 
by its equations, F(x°, x', x”, x°) =0. The surface element on B can be 
represented by a four-vector do” defined by 
ie Gs ak dx, dx” dx? dx*, dx° dx’ dx°, dx° dx* dx’). 
If B is the ordinary three-dimensional space for dx° = 0, or x° = const, 
then do, is in the direction of n,:(1, 0, 0, 0). If we introduce the three 
parameters u,, vu, u; on B, Gauss’ theorem reads 
ee ¢ @¢ ¢° 
ox° wax’ 6x? dx° 
du’ du! @u' du! — 
dg" Oe ax! ox?‘ ex? 
a —— eee tidy? da” = | do,d", (1:82 
" si {, Qu? Gu* Gu? du? eee . gee ane 
ox Wx! OF Ox? 
éu> du? du> due 


where we used the relations 
GG x"x") 
~ @(utu?u3) 


Thus do, are independent of the parametrization of the surface B. 


doy du! du? du? = dx’ dx? dx’, ete. 
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Of particular importance will be the notion of a space-like surface. 
B is space-like if do“ is everywhere a time-like vector. A space-like surface 
can be defined also as the set of all vectors x, y, -:- , whose “distance” 
(x — y)? is space-like. As we know, the normal vector to a three-dimen- 
sional space-like manifold is time-like. The only difference between three- 
dimensional space and a space-like surface is that in the latter case the 
normal four-vector to the surface may be an arbitrary time-direction 
which is equivalent to the x° direction. Points of space-like surface cannot 


a > nn 
4 


oS 


oi 


(a) (b) 

Fig. 2. (a) Space-like surfaces passing through the origin and their 
“normals”; (b) Space-like surfaces not passing through the origin 
be reached from each other by any signal. A space-like surface passing 
through the origin will thus consist of vectors y which are orthogonal 
to a time-like vector n:ny = 0. An arbitrary vector x has a “component” 
on this surface given by y = x — n(nx). [See Eqs. 1.9 and 1.9’.] We will 
also introduce space-like surfaces not passing through the origin. These 
will be characterized by vectors y satisfying ny = 6, where @ is a real 
parameter. For, if two vectors y, and y, satisfy ny, =ny, = 9, then 
ny; — ¥2) = 0, or all the differences (y, — y2) are orthogonal to n. The 
spatial component of an arbitrary vector x is now 


y=x—n(nx — 8). 


6 will play the role of a coordinate invariant time variable. (Fig. 2.) 
From Gauss’ theorem it follows that for a divergence-free vector, i.e., 
o%, = 0, we have the relation 


| do,o* = (01, 
B 
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If we now take B to consist of any two space-like surfaces o, and oa), 
and if we assume that ¢“ approaches zero sufficiently rapidly, then we 
can neglect the contribution to B of the infinite cylinder connecting o, 
and o,, and we get 


0 -( do,o" = { do," -{ do,¢", 
B o2 m1 


One fg do," is independent of the space-like surface o, not only at a given 
time, but at all times. In other words, under the stated conditions, f do," 
is a constant, for which we can also write, choosing, say, the surface t = 0, 


f do,o" = J 6° d*x = const. (1.83) 


The converse of this theorem is also true: if J, do," is independent of o, 
then $1, = 0. { do," is independent of o, then $4, = 0. do," is then a 
scalar number. 


9. Further Developments and Exercises 


1. Consider all linear, nonsingular transformations that leave 
x°’ —r? =0 invariant and that map origin into origin. If we do not 
invoke the dimensional argument, what other restrictions are necessary 
to derive Eq. 1.3 in text? (See V. Fock, The theory of space-time and 
gravitation, Pergamon Press, 1959.) 

2. Discuss the eigenvalues of complex Lorentz transformation. [See 
E. P. Wigner, Annals of Mathematics, 40, (1939), 149.] 

3. Verify Eqs. (1.31) and (1.33). 

4. Appearance of moving objects. The eye or the camera looking at a 
moving object records photons arriving at the same time. Due to the motion 
of the object, these photons cannot have been emitted at the same time. 
Show that for an object subtending a small solid angle this distortion of 
the object cancels, to a first approximation, the distortion due to Fitz- 
gerald-Lorentz contraction so that, say, a moving circular object will 
appear circular again. 


5. The clock “paradox’’. Innumerable papers have been published on 
this famous problem.’ To avoid the complication due to the turning of 


1 For some recent expositions, see A. Schild, Amer. Mathem. Monthly, 66, (1959), 1; 
G. A. Blass, Amer. Mathem. Monthly, 67, (1960), 754. 
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velocity, we modify the problem as follows: Consider three points on a 
straight line A, B, C, the distance AB being equal to the distance BC. 
A light signal is sent from B. At the arrival of the signal at A and C, two 
travelers move from A and C towards B with the same speed. If, in the 
fixed coordinate frame at A, the time needed to travel from A to B is ¢, 
the time needed in the moving frame is t’ = t(1 — B”)'/*. However, if we 
consider the moving frame as the primary one, the point A moves back- 
ward with the velocity —v. The distance AB will appear Lorentz-con- 
tracted to a traveler in a space ship, x’ = x(1 — B?)!/?. Again measured 
from the space ship, x’ = vt’, or t’ = (x/v)(1 — B?)'/. Therefore, with 
respect to both coordinate frames, the time t’ is shorter, and there is no 
paradox. When the traveler reaches B, we concentrate on the traveler 
coming from C; the “return” trip is symmetric. If we consider a round- 
trip instead of the point C, the argument is the same. The effects of 
acceleration at the turning points can be neglected, the problem is purely 
a special relativistic one. 

6. Prove that the determinant of L(A) and L(A, B) defined by Eqs. 1.41 
and 1.77 is equal to +1. Hint: Start from the equation [see before Eq. 1.41], 


LF ai = A,'6,4AZ*), 


which can be written as (A x A*)T = TL, where 


Go11 %1,11 F2,11 F3,11 1 0 O 1 
T= 1 [9,12 Cpe Cope Ospia| alg 0 1 -i 
V2 Oo,21 O1,21 G2,21 93,21 v2 10 1 i 
Go,22 93,22 F2,22 63,22 1 0 0 -!1 
Hence 
Ax At prt 
and 
[tr Al? = tr L, 
det (A x A*) = (det A)* = det L. 
[Note that det (A x B) = (det A)*'™? (det Byam a 
7. Obtain the inversion of Eq. 1.41, 
Lora. 1 : = 
£4 Coglgottalae ~ (tL + (Lin + Ley — ie LAa,), 


where 
N? =4 + tr(LL) — (tr L)? + ie *L, Le, 
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8. We have introduced the matrices, 6,:(1, —o). Prove the following 
relations: 
og’ = gg + yiet'"o 6, : 


G0” = g"” — tie”G,0, ; 
[o", 6] = hie’ [o,6,-6,0,];  {o", @} =2gh” +4 er [o,6, 
ou +o oO = 29" + 6,0, ] 
tr[o"é"] = 29”; 
trlo"6"o76"] = 2[ gg’? — gg”? + gt?g’* — ict”) ; 


OzaF upp = 20 apap - 


9. Covering group. Show that the group SL(2, c) is the universal cover- 
ing group of the proper Lorentz group Lt. (See Wightman, Joc. cit.) 

10. Prove Eq. 1.74. 

11. Show that the three-dimensional complex orthogonal group 0; is 
isomorphic to the proper Lorentz group L*. and that the corresponding 
formulas (as in Exercises 6 and 7) are 


R;; = 4 tr[o,Ao;A~*] 5 


+A=—(1i + 6;0;R;;); N? = 4(1 + tr R). 


1 
N 

12. Obtain the connection between the parameters of the Lorentz 
group in x,=L,’x,+a, and that associated with the infinitesimal 


generators 


og ee 


13. Discuss Lorentz transformations when curvilinear coordinates are 


used. 
14. Consider the four-vector 


Wt = te", J,, ; 


Show that W?, like p? = p,p”, commute with all the infinitesimal operators. 
Show that 

[p,, W,] = 0, Prager | = — ie" pW. 
' Fora given momentum p, p? =m’, m # 0, choose a basis, as in Eq. 1.13, 


with vectors 
£0 = p/m, Qi, @2;5 €35 
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satisfying Eq. 1.14. Let 
3 


w= 3 Sie?, 
i=1 


because W*p, = 0; then ; 
y= Se 


and the three operators S' have the commutation relations of the angular 
momentum 
[s, S4] = ime*S*. 
This provides a covariant description of spin or polarization for particles 
of nonzero mass. The S' are the infinitesimal generators of the three- 
dimensional orthogonal group which has finite-dimensional represent- 
ations such that 
W? = —S? = —j(j + 1)m, 

and W-y has eigenvalues m, —j <m < +/j; j is an integral or half-odd 
integral number. 

15. Show that unimodular matrix A corresponding to a Lorentz trans- 
formation, which takes a vector k into the direction (m, 0, 0, 0) (i.e., rest 
frame), is given by 
: oe (=) ie (“2)" 

m m 
the Hermitian square root of (k"o")/m. 
Show that for a four-vector, with k? = m?, 


(ko,,/m)'!? = cosh(z/2) + k-o sinh(y/2), 


where k is the unit three-vector in the spatial direction and cosh y = k°/m. 

16. Solve the same problem for light-like and space-like vectors when 
they are transformed into the vectors (m,0,0,m) and (0, 0, 0, m), 
respectively. 

‘17. Determine the group of Lorentz transformations that transform a 
given four-vector into itself. (The little group of the four-vector.) Show 
that for a time-like vector the A’s of the little group are unitary and that 
the little group is isomorphic to the three-dimension rotation group. 

Show that the little group of a light-like vector is essentially iso- 
morphic to the group of rotations in a Euclidian plane. 

18. Show that any non-singular matrix A can be decomposed in one 
and only one way in the form A = HU where H is positive definite her- 
mitian and U is unitary. Apply this theorem to 2 x 2 unimodular matrices 
and show that U generates a rotation and H a pure Lorentz transforma- 
tion. Give their explicit forms and prove the decomposition, Eq. 1.32’. 
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II. 


3 
DYNAMICS 


1. Proper Time Form of the Equations of Motion 


We start with the notion of the mass point and shall find its laws of 
motion in accordance with the principle of relativity and with Lorentz 
transformations. As we have discussed at the beginning of Chapter I, we 
expect to find a generalization of the equations of the classical mechanics 
that will reduce to the classical case in the limit of small velocities. 

We cannot consider the time f as a parameter in the same way as in 
Newtonian mechanics, for Lorentz transformations involve also a change 
of t. However, this does not mean that we cannot differentiate with respect 
to t. In each frame one can still define velocity and acceleration and so on, 
as we shall see. A formally simple way, which parallels exactly the deve- 
lopment of classical mechanics, is the introduction of an invariant para- 
meter s which will play the role of t. We discuss this formalism first. 

The trajectory of the mass point in space-time will be deseribed by the 
“position” vector x“(s) as a function of the invariant parameter s to be 
specified presently. The trajectory in the four-dimensional Lorentz space 
will also be referred to as the world line of the mass point. As the invariant 
parameter, we first choose s defined by 


ds? = dx" dx, = dx™ — dx” — dx”’ — dx®; (2.1) 


later we shall also use arbitrary parameters t. ds? is the simplest invariant 
one can form, and it has a definite physical meaning: In a coordinate 
system in which the particle is at rest dx = 0, and ds? reduces to 
ds? = dx” = c*dt*. 
47 
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Thus, in the rest frame of the particle s is essentially the time, apart from 
the constant factor c?. We shall denote s the proper time of the particle. 
The world line of the mass point in the rest frame is just the x° axis. In 
any other frame the value of s, not the meaning, will be the same as in 
the rest frame. 

Now we can define velocities and accelerations with respect to the 
invariant parameter. The four-velocity is defined by 


dx 
n= aS) , or, in components, u* = a) = 7%). (2.2) 
ds ds 
The four-velocity u is always a unit time-like vector, for 
dx dx dx? 
=——=— = 1. 2.3 
ds ds_— ds? Cy) 
The components of u can also be written as 
5 ee axed 
=>) wS— Sey, 
ds ds c 


where v = dx/dt is the ordinary velocity. The three space components of u 
are the velocities in the frame of reference considered, the time component 
is something new. This can be obtained from Eq. 2.1: 


axe 1 1 dsc 
A ill, Se SRE Sy Se _ TRE —_—S-, 4 
a ro a (lel ca ees 
Similarly, we define an acceleration by 
. du 
ee (2.5) 


The four-vector x is always orthogonal to the velocity x = u. For, dif- 
ferentiating Eq. 2.3, u? = 1, with respect to s we get 
uu = 0. (2.6) 
Thus we have in the 4-dimensional Lorentz space an analogue of the 
circular motion where velocity and acceleration are always perpendicular 
to each other. Both are consequences of the condition x? = const. 
Finally, we introduce the relativistic momentum. Here we need a new 
parameter characterizing the mass point, the rest mass mp. It is an intrinsic 
and invariant constant influencing the motion of the mass point. Its 
importance lies in the fact that different particles can be approximated 
by a mass point, as far as their motion is concerned, with different values 
of my. Otherwise, for the concept of abstract mass point, My can be put 
equal to one. From experience it has been found sufficient to specify 
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different particles by a single parameter within some limits. We shall 

see later that other intrinsic parameters, such as electric and magnetic 

moments and spin, must be introduced also. The explanation of the nature 

of these parameters, such as the rest mass mg, is outside the scope of 

mechanics, which concerns itself with the laws of motion. Their expla- 

nation must be found in the theory of the structure of matter or of particles. 
The four-momentum is defined by 


’ p= Mocu, (2.7) 
or, in components, 
The momentum p is always a time-like vector with norm 
p” = moc? = p™ — p’, (2.9) 


which follows from Eq. 2.3. Putting, formally first, m =m /(1 — B?)'/?, 
the components of p take the form p° = mc and p=m-y; i.e., the usual 
classical space components of the momentum. The physical significance 
of this relation will be discussed below. Note that in the limit v/c + 0 we 
get p = mov. 

Having defined the necessary concepts of the motion, we proceed now 
to the formulation of the law of motion. We see from Eq. 2.9 that, given 
Mo (positive), the motion must be such that the vector p will always lie in 


Hyperboloid (mass shel\) 


Fig. 3. The most general motion of a particle is described by a 
function p(s) on the forward mass shell hyperboloid. p is the 
momentum, s the proper time 
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the forward hyperboloid shown schematically in Fig. 3. Because Lorentz 
transformations also keep the vector p(s) on the hyperboloid, special types 
of (linear) ‘“‘motions” will be obtained by the Lorentz transformations 
L, which take the vector p(s) into some other vector p(s’). But we want 
to discuss the motion in a physical sense such that in the limit we get 


Newton’s equations.* 
The simplest invariant generalization of Newton’s equations is 


=k (2.10) 


where K is the four-force. We see from Eq. 2.6 that K is always orthogonal 
to the velocity, 

Ku = 0. (2.11) 
We shall refer to Eq. 2.10 as the Minkowski equation of motion. 

We have not yet specified the form of K which characterizes the 
rate of change of the momentum. However, let us first see the physical 
significance of Eqs. 2.10 in the more familiar form by showing that 
these equations contain, in the limit, the three Newton’s equations plus 
the energy equation. 

Using Eq. 2.7, we write Eq. 2.10, viz., 


in the following form: 
d dt dx" ds 
i "°° ar Tice dik 
Comparing the three space components (u = 1, 2, 3) of this equation with 
the classical equation (d/dt)(m dx/dt) = F, we get a formal equality if we 
put 
dtm set 


ds (1— py? 


There is more than a formal analogy. In the limit v/c + 0, both equations 
become identical, and we see how to obtain the relativistic form of the 
Newtonian forces. Eq. 2.12 has also been interpreted by saying that the 
“mass” increases with velocity. But there is no internal mechanism in- 
creasing the mass of the particle; the really physical mass my does not 
change. What Eq. 2.12 says is that we could correct the classical equation 
by making the mass variable according to Eq. 2.12. But Eq. 2.10 is the 
more fundamental formulation of the relativistic generalization. 


Moc — m, and K aa =F. (2.12) 


1 Even classical mechanics can be put into a form of transformation theory. 
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Let us return to the time component of Eq. 2.10: 


En <a) -4 oe Ke 

ds\ ° ds} ds(1— py’? * 

From Eq. 2.11, we find K°: K° dx°/ds = K-u. Or, using Eq. 2.12, 
d 
= (mc?) = Fry. 


We can integrate this equation, obtaining = 
me? = J F-v dt = { Fedr. 
If we take conservative forces; i.e., F = —Vd¢, we find! 
mc? + @ = const = E. (2.13) 


We shall interpret the constant E as the total relativistic energy, thereby 
obtaining also the conservation law for the total energy. This is based on 
the limiting form of Eq. 2.13 as v/e +0. Expanding Eq. 2.13 in powers 


of v/c, we get 
4 


v 
E = moc? + $mgv? + 3m pe ae @. (2.13') 


The total energy must be defined with the additive constant mc’, called 
the rest energy in order to satisfy the conservation law, Eq. 2.13. The 
nonrelativistic definition of the energy is obtained as follows: 
Eyr = lim (E — moc’). (2.14) 
v/c>0 
The quantity T = (E — moc” — ¢) can be defined as the relativistic kinetic 
energy.” 


1Or, we can integrate directly the equation (d/dx)(mc?) = cK® = (ds/dt)K-u with 
respect to s; 


mec = 2 K-uds = [Gwar = f -Vee-ae = —¢ +F, - 


! dt 
by Eq. 2.12. : , 
2 The expression for T can be obtained also from the following argument: F:dr 
is the work done by the external forces on the particle and is equal to the increase of 


its kinetic energy; , 
‘de = — dr = dT, 
F-dr os (my) dr 
or, 
mov ° dy 
<r 
where we have obtained dm from Eq. 2.12. Integrating this equation with respect to v 
between 0 and v, we get 


dT =v? dm + mv dy = 


1 
i= mec!| «— ae = 1}, 
which is just (E — moc? — ¢). 


° 
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Now we shall express E as a function of the momentum, rather than 
the velocity as in Eq. 2.13 or Eq. 2.13’. From Eq. 2.8 and Eq. 2.9 we get 


1 
E= p+; m2? = (E- 9)? —p*, (2.15) 


for the type of conservative potentials we are considering. 

In particular, a free particle [¢ =0, or K =0 in Eq. 2.10] is charac- 
terized by a constant time-like vector, momentum vector p, with com- 
ponents 


E FE? 
pee (=.»); p= moc? => =P’, 
Or, 
E? = c*p? + mic*. (2.16) 


Using the definition, Eq. 2.14, of the nonrelativistic energy and 
expanding E with respect to p?/m2c? in Eq. 2.15, 
4 
Eyr=5-——-—zoaat +d (2.17) 


which is the classical energy relationship with the first-order correction 
term in kinetic energy. 


Particles with Zero Rest Mass 


The concept of a zero-mass particle does not exist in Newtonian 
mechanics. As a matter of fact, our definition of the four-momentum, 
Eq. 2.7, breaks down if mo = 0. However, if we take E and p as primary 
concepts rather than the velocity or the trajectory, Eq. 2.16 still makes 
sense when m, = 0. In this case p cannot be connected to the velocities; 
any localization of the particle is not defined, or not possible. We have 
then a nonzero, but light-like momentum vector P:i(E/c, p), with 


E = pe, (2.18) 
where p,= (p’)'/?. Furthermore, we see from Eq. 2.17 that there is no 
nonrelativistic limit to this concept. A particle with m, = 0 must be neces- 
sarily relativistic. We also note that at higher and higher momenta the 
particle behaves more and more like a particle with rest mass zero. 

At this point we shall discuss an important realization of a particle 
with mp =0, the photon.’ We have seen in Chapter I, Section 4C that a 


1 The concept of photon goes back to Einstein (Ann. Phys. 17 (1905), 132; Physik- 
alische Zeits. 10 (1909), 185 and 817). 
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plane electromagnetic wave is characterized by a light-like four-vector k 
(plus the polarization property), k =(w/c,k), where w is the angular 
frequency of the wave, and k is its propagation vector such that 


@ = kc, 


where k = (k”)'/?. In various experiments, such as those concerned with 
blackbody radiation, the photo effect, the maximum frequency of x-ray 
spectra, etc., one can associate to a light plane wave an energy 
proportional to w, and a momentum proportional to k. The proportion- 
ality constants in both cases must have the same dimensions and their 
numerical value turns out to be equal, 


=n, p= hk. 


This is essential, for then we see that E and p defined for the plane wave 
exactly satisfy the energy and momentum relationship for a relativistic 
particle of my = 0. In dealing with questions regarding energy and mo- 
mentum interchange of light waves, say with other particles, the plane 
wave can be considered as a particle with m) =0. The appearance of h 
(whose value is taken from experiment) indicates the so-called quantum- 
mechanical effect, and the above equations, which connect particle con- 
cepts like E and p with the wave concepts w, k, constitute a fundamental 
principle of quantum mechanics.’ Another mass-zero particle is the 
neutrino, which is associated with the plane wave of a spinor field. 

Any generalization of a special physical theory must be verified experi- 
mentally. With respect to our Minkowski equation, p = k, we can state 
that both of its conclusions, namely the three-dimensional equation of 
motion or the variable mass, and the energy expression and its conservation, 
have been verified experimentally, the latter in nuclear-reaction experiments, 
where the rest masses of the particles undergo a change, and only the 
total energy defined with the rest energy term is conserved. Therefore, we 
take the Minkowski equation as the proper generalization of Newton’s 
equations. 

1Jt is interesting to indicate that, conversely, to a particle with mass mo one 
can associate a plane wave, a so-called matter wave, whose w and k are again ob- 


tained by E = fiw, p = hk. But now & is no longer a light-like vector. From Eq. 2.16 
we obtain 


274 
Sa 
aw? = c2k2 + 


This relation for the de Broglie waves has been verified experimentally. L. de Broglie, 
Annales de physique, 10 serie, 2 (1925), 22. 
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Explicit Forms of the Minkowski Force K+ 


So far we have left the force in our fundamental equation, Eqag2) 10: 
arbitrary, except for the requirement that 


1 
K,u"=0, or K°=-—Kv-v, (2.11') 
c 


This requirement does not determine K uniquely. We now discuss the 
possible forms of K,, some of which are realized in nature. 

Equation 2.11’ implies immediately that some or all components of K 
are velocity-dependent. For example, a constant Newtonian force F corre- 
sponds to a Minkowski force K(v) = F/c(1 — B)'/?, which is increasing 
with velocity with K° = F-v/c?(1 — B?)'/?. Even if K were made constant, 
K° would be velocity-dependent and vice versa. Because u“u, = 1, the 
simplest form of K, is a linear combination of the components of u. For 
this “linear” force, we write 


K,, = AF,,,(x)u’. (2.19) 
The form, Eq. 2.19, will satisfy K,u* = 0 for all values of u" if, and only if, 
F(x) = ~F,,(x); (2.20) 


in particular, F,, may be constant numbers. 

The electromagnetic forces on charged particles belong to this class. 
To see this, let us choose 2 = e/c, where e is the electric charge, and connect 
the antisymmetric tensor F,,,(x) to the components of the electric field E 
and the magnetic field B in the following form: 


OF ££ €&E; OJ, Eyer, 
one O -—B; B, a 0 B; —B, 
0 0 
ees = es (2.21) 
FY = 0 —B, B, 
0 —B, 
0 


Also, we wrote the form F“, and F#” forthe purpose of convenient reference; 
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they follow immediately from F,, with our metric Guy The Minkowski 
equation with Eqs. 2.19; i.e., 


d (mn =) me re 5 
cE of =; boo’, (2.22) 


are then equivalent to the equations 


d e 
a (mv) = eE + aw x B), (2.22’) 
and 
dT 
— 2 = v= 2 = 2 
FF (mc*) = eE-y a (mc* — moc*), 
or, 
me? =e | E-dr. (2.22”) 


Equation 2.22’ gives the Lorentz force on a particle from a given external 
electromagnetic field, and Eq. 2.22” the total energy of the particle to 
which only the electric field contributes. Equation 2.19 indicates in addi- 
tion that F,,,(x) must transform as a tensor under Lorentz transformations 
if K* and u* are four-vectors. That is to say 


F(x) = L", L’, F7°(x). (2.23) 


This gives us the transformation properties of the electric and magnetic 
fields; for example, 


E;(x')= —LjL, F(x), 


and similarly for the other components. Thus, a given external electro- 
magnetic field can be put in covariant form. The covariant form of the 
field equations will be discussed in the next chapter. Here we have con- 
sidered F,,(x) from the point of view of an external force. 
Next we look at the form of the Minkowski force quadratic in 
velocities :7 
K* = Gere, . (2.24) 


To satisfy the condition Ku, = 0, G*”’ must be antisymmetric in any two 
indices. Consequently, it will have 24 independent components. As a 


1 The equation of the motion of a particle in a given gravitational field is of this 
form. Let guy be the gravitational potential. The gravitational field is given by 


Tyo# = 49+#°(ga0,v + Gve.0 — Gva,o); 


then the equation of motion is 
du’ 


at SS af vo#urue, 
ds 
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special case, we can take G*’’ to be antisymmetric in all three indices, 
then it will have only four independent components and can be written 
in the form 

aaa iad Oy: (2.24’) 


where «”’?? is the completely antisymmetric symbol with elements +1 or 
~1, depending on whether (uvop) is an even or odd permutation of 
(0, 1, 2, 3); hence ¢,,, =4!. G, is called the dual of G*”’. 

This process of building up K“ from linear combinations of u" obviously 
can be continued to higher-order tensors. 

Another class of possible force terms can be obtained by noticing that 
(Guy — u,u,)u” = 0. Hence we can put! 


KY = @(x)(gh — uMu,). (2.25) 


Here again K" is determined by four functions (x), as in Eq. 2.24’, in 
contrast to the six independent components of Eq. 2.19. Jn particular, 
$,(x) can be thought of as the gradient of a potential 
10¢ 1 
Ke = 3o ogy ee) =e . 

+759 u"u’) ~ ou", (2.26) 
where we have also added a term proportional to the acceleration, which 
we can do because iu, = 0. We rewrite Eq. 2.26 as 


0p dd d 
Ke = SS eS ee ie fis Se B ’ 
c + Ee u F gu’ =+¢ a (pu). (2.26) 
For this case the Minkowski equation takes the form 
d $0) ewan! 
< || moe + 2 |e i= +¢ ec (2:27) 


In particular, if ¢¢/8x9 =0 in some frame, from Eq. 2.27 we get the 
conservation equation 


[moc? + $(x)]u° = const = E, (2°28) 

which is again our Eqe2.13 ier, 
(x) 
Bee pa (2.28') 


1 Writing Ky = An + uyBruy, we find Ay = — Bu = $y. This procedure can be 
generalized to higher tensors also: Putting KY = Fury, + utueurGoy, we find Fur = 
OAD Gye 

K4 = Fo"(g#oy” — ytysy?), (2.25’) 
and so on, 


Relativistic Dynamics a7 


Thus, the potential defined in Eq. 2.26 can be considered as the rela- 
tivistic generalization of the Newtonian potential. We get a potential that 
enters into the Newtonian energy equation only in the limit v/c > 0. 

Finally, we remark that if we had a constant of motion V(x) along the 
trajectory, i.e., dV/ds = 0, then K, = V,, would be a possible form of the 
Minkowski force, for K,u" = V,,u" = dV/ds = 0. 

We have discussed some of the possible forms of the relativistic force 
and shown how to relate the origin of the force to the coupling of the 
velocity of the particle with external scalar, vector and tensor fields. In 
particular, we have introduced forces due to the external electromagnetic 
and scalar fields. A more general discussion of the coupling between 
particles and fields, which is the main subject of this book, will be given 
in Part II. 


Angular Momentum 


As in classical mechanics, the equations of motion can be expressed 
in terms of angular momentum. The correct generalization of the three- 
dimensional axial vector J=rxp is the second-order antisymmetric 
tensor, as we have seen in Chapter I in the generalization of the curl. 
Hence we define the covariant angular momentum by 


=x) — xP, Phir halal (2.29) 
or, in component form, 


23 
fee), yea-s,, JP=4,, 
and 
x* 
J = x° pk — x*p? = ctmv* — x*me = met(o - *). 


We see, therefore, that the three spatial components of J*” represent 
the total angular momentum of the system J. The remaining three time 
components J do not have a simple physical meaning for a single 
particle. For a linear motion these components clearly vanish. In general, 


we can write 
x* = w*+—, (2.30) 


and associate J*° with the deviation from the linearity of the motion. The 
magnitude of the angular momentum is 


JO'S py = 2x? p? — (xp)?] = — 2% + 2°. 
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We also define a relativistic torque 
N* = x"KY — x’ K*; (2.31) 
then the equations of motion can be written in the form 
Je = NB, (2.32) 


using Eq. 2.7 for the linear momentum. We have here six equations 
instead of the four Minkowski equations, three being connected with the 
new quantities J*°. If N“” =0, we get the conservation of the angular 
momentum J*” = const. However, the equations No, = 0 do not corre- 
spond to any simple physical situation, in general. We shall come back 
to the discussion of the angular momentum in more specific physical 
problems. 


Systems of Colliding Particles 


So far we have considered a single particle in an external force field. 
The invariant formulation of the motion of two or more interacting 
particles is complicated by the fact that each particle will have a different 
proper time. Hence, we must use either a many-proper-time formalism, 
or a single invariant parameter t for all the particles. In the latter case, 
simple relations like u? = 1, p? = mic?, --- will no longer be valid. No 
exact general theory seems to be available. This is one of the reasons for 
the significance and advance of field theory. The action-at-a-distance 
formulation of electrodynamics is an exception and will be discussed in 
Part II. 

However, there is a simple situation of considerable practical interest 
which can be treated partly (apart from the trivial case of noninteracting 
particles). This is the case of collision and emission of a number of particles 
at a single point. If the particles originate from a point, then there is a 
common frame of reference in which all particles are instantaneously at 
rest. In that frame and at that time, vy = 0, but dv/dt #0; hence the time 
component of the force K", K°, is zero, and K = F, all at a given instant. 
Therefore, for K we can use the principle of action and reaction of New- 
tonian mechanics. Thus, summed over the momenta of all the particles, 
we get 


> p,. = const. (2.33) 


This equation is then true in all frames because it is true in one. 
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In a reaction involving incoming and outgoing particles, we have to 
consider the sign of the momentum vectors 


» Pa(initial) = > Pa(final) (2.33’) 


This is the law of the conservation of energy and momentum.! In com- 
ponents, 


> (p2 + m2c?)'/? = (pj + mpe?)'"?;  Y p= Vp. 
a a 8 


Also it will be useful to introduce the center-of-mass frame of two 
(interacting) particles. This is the special frame in which the total momen- 
tum vanishes” 


Pi + P2 = 0; —P2 = Py =P. (2.34) 


The center-of-mass frame momenta have the components 


Pi: (E,, +p), P2 =(E2, —p). 


From any arbitrary frame in which the momenta are p}, p} (say, the 
laboratory frame), one can pass to the center-of-mass frame by a Lorentz 
transformation L defined by 


L(p, + P2) = Pi + P2- 


In calculating energy and momentum relations, it is convenient to 
make use of the fact that the scalar products p,p; of four-vectors are 
Lorentz scalars and can be evaluated in any frame. For example, in the 
reaction a > b + c, we get m? = p? = (p, — p.)* =m? + m? — 2p,p., where 
the last product is equal to m,E, in the frame in which a is at rest. 
Or we can form p? = p? +p? + 2p,p,, where the last product in the center- 
of-mass frame is 2(E,£. + p”), and in the laboratory frame is 


2[ EE. — (P.)(Ps)].- 
Similarly, for the reaction a + b-+c + d one can form invariant rela- 
tions of the form 
(DP, - Py)? a (P. RE Pa)’, 


or products like? 
Pa(Pa + Po) = Pal Pc + Pa): 


1In field theory the conservation of momentum will be related to translational 


invariance. 
2 One can also write a covariant definition, 


pi + p2 = 0, 
instead of Eq. 2.34, which implies Eq. 2.34 but not vice versa. 
3 See Exercise 3. 
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2. Lagrangian Form of the Equations of Motion 


In this and the following sections we shall discuss the generalization 
of the methods of analytical mechanics to the relativistic case, mainly the 
Lagrangian and canonical formalisms for the equations of motion. These 
methods do not use the concept of force and differential dynamical prin- 
ciples (dynamics in the small); instead they use the integral principle for 
the whole of the motion (dynamics in the large). In spite of their formal 
beauty, the integral principles seem to exclude some interesting physical 
situations, such as the frictional forces. 

First, we wish to write a covariant Hamilton’s principle for a single 
particle. Using the proper time introduced in the previous section, the 
action principle would be the following. There exists a functional of L(x*(s), 
x"(s), s) such that the true trajectory x“(s) makes the action integral 


| L(x*(s), X"(s), s) ds 
St 
an extremum. More precisely, for the true motion between S, and s, the 
variation 

S2 

r) | L(x"(s), x“(s), s) ds = 0, (2.35) 

31 
where the variation 6 is so defined that the end points x"(s,) and x*(s,) 
are fixed, and 6, = 0 at these points. However, we cannot write formally 


the Euler-Lagrange equations corresponding to Eq. 2.35 because we have 
the subsidiary condition 


u7(s) = x” =1; 


consequently the variations are restricted. In such situations one can use 
either the method of Lagrange multipliers, or use an arbitrary invariant 
parameter t. We discuss the latter method first. 

The action principle is now 


s| L(x"(z), v"(z), t) dt = 0, (2.36) 
where we have put 


dx*(t) dx" ds ds 
=. 
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to distinguish it from u“, which is always understood to be the velocity 
with respect to the proper time s. We have also the relations 


hence, 


& ie, (v)!/2, and pt = u"(v?))/2, 


It is important to realize that the parameter t in Eq. 2.36 does not 
refer to a specific particular choice t = 1(s), but to any of the totality of 
such parameters. Equation 2.36 is supposed to be valid for any parameter 
t, for if we fix the function t = t(s), we will still get a subsidiary con- 
dition to the action principle which appears to be an additional restriction. 

First, let us calculate the variation of the integrand of Eq. 2.36. We get 


with dt = 0, or 
aL sod [aL d (aL 
as aS = Hh 
aa Fea dt = () [ox ware (= - ), (2.37) 


where we have made use of the relation 


les £ (6) = 5x" (>) 


For the particular variation indicated in Eq. 2.25 or 2.36; i.e., with 
end points fixed and dt = 0, we have 


5§Ldc=f dL dr. 


In the integration of 5L, given by Eq. 2.37, the second term does not 
contribute because 5x“ is fixed and zero at the boundaries: 


OL d [dL 
= iu i 
[ar { [FE - a(S )]ox drt =0. 


And, because 5x” is arbitrary inside the boundaries, we get the equation 


of motion. 
OL da [oL 
eels | (al er) 2.38 

ox" dt (=) ‘ es) 


which must be understood to hold for an arbitrary unspecified parameter t. 
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When the Euler-Lagrange equations, Eq. 2.38, hold, we can write a 


posteriori for the variation of L from Eq. 2.37: 


d (oL 
= a Red 2.39 
6L= F (= Ox’ ), ( ) 


with 6t = 0. 


Examples: (a) A free particle may be defined by the Lagrangian 


d 
Lo = moc(v?)!/2 = moc ae (2.40) 


Then Eq. 2.39 gives 
d vy] 0 
ae |" aa] = 2 
or 


d 
a (mocu*) = 0, 


which is the Minkowski equation for a free particle.! 
(b) Consider the Lagrangian 


L= Lg + Ly = mpe(v?) "2 + : A,(x)v", (2.41) 


where the interaction part of L couples the particle velocity to a (external) 
vector field A,,(x). Eq. 2.39 gives, in this case, 


(v 


d e(GA, 0A, e 
— | (ae ee v = — ve 4 
oF (mocu,) : ( aH ak - Putt (2.42) 


This is the Minkowski equation, with the force term given by Eq. 2.19, 
that we have identified as that of a particle moving in an external electro- 
magnetic field. We also see that the “coupling” in the Lagrangian is in 
terms of the so-called electromagnetic potentials A,(x), and that the 
electromagnetic field strengths are given by the curl of the potential, 


A, OA, 
Ox* = ax" 


F,,(x) = (2.43) 


1 Note that in this example fdr = J moc ds. However, one cannot take moc as 
the Lagrangian, one must also vary s, whereas in our formulation we did not vary 7. 
Both these methods and many others give the same result in simple examples. 
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Writing 
Ate (¢, A), A. : (¢, =A), ‘ (2.44) 


and using Eqs. 2.21, Eq. 2.43 can be written in the more familiar com- 
ponent forms 


10A 
E=-V$-->, B=VxA, (2.43’) 


where @ is the scalar potential and A the vector potential. Note that for 
A = constant number we get a Lorentz force 


1 
K° = —-u Vd, me — Live, 
c c 


discussed earlier, which differs from the pure scalar field of Eq. 2.26. 
(c) As a third example, we consider the Lagrangian 
L=Ly + Ly = [moc + 6(x)](v?)'/2, (2.45) 


which leads to the equation of motion, 2.27, discussed before. 
(d) Another example of a Lagrangian homogeneous of first order in 
velocities and describing the interaction with an external tensor field is 


L= | moe — Pu GR ay | 0), (2.45') 
which leads to the equation of motion 


d d 
as (mocu,) = ds [yy + dy,)u" = oop4 "uu, | = dggqutat. 


Generalized Momenta. Returning to our general equation, Eq. 2.38, 
we define the generalized momenta by 


OL 


Put sae (2.46) 
which gives 
d oL 
eS 2.38’ 
de PH Bye ( ) 


If the Lagrangian does not contain a coordinate x” explicitly, then the 
corresponding component p” of the momentum is independent of t or 
conserved. 
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For the examples discussed before, Eqs. 2.40, 2.41, 2.45, 2.45’, we 
obtain, respectively, 


p* = mycu* = const, 
e 
p* = mocu*® + z A*(x), 


p* = [moc — $(x) Ju", 
p* = mocu, — (dyy + Gy, )u” + Go ,u7uPu*. 


Example (b) shows the difference between the generalized or canonical 
momentum and that used in the Minkowski equation, Eq. 2.7. 


Variation of Endpoints. Consider a Lagrangian not explicitly dependent 
on 7; i.e., L = L(x*(t), v(t)), and a new kind of variation which also 
involves the variation of the end points, 


T2 


ow = 3 Lar =| 6L dt +L dt (2.47) 


Th Th 


v1 


For 6L we use Eq. 2.37, assuming, however, that the equations of motion 
are Satisfied (i.e., for the actual path); then 
OL "2 t2 
bW = Be bx* + L ér| vit) 
dv" 


v%] 


= F(t2) — F(t). (2.48) 
TL 
This equation is important in the discussion of constants of motion; 
namely, if we can find a variation such that W is invariant under it; i.e., 
oW =0, then F(t.) = F(t,), or F(t) has to be a constant of motion. F is 
called the generator of the transformations on W. The generators of 
invariant transformations are constants of motion. 

We now try to introduce formally a Hamiltonian, as in nonrelativistic 
analytical mechanics. Consider a Lagrangian L = L(x", v*), and form 


Using the equations of motion, Eq. 2.38, this equation can be rewritten as 
dL a 
ahd (2), 
dt dt\ év" 
or 
d ab’ _dC 
( v" ) =—=0 (2.49) 
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In the nonrelativistic case, C obtained by this procedure is the Hamiltonian 
and has the interpretation of energy.’ In the relativistic case, energy is 
the zero component of the momentum. We shall see that C can be inter- 
preted formally as the “Hamiltonian” [even though it is not energy and 
its numerical value is zero (see below)] as far as the derivation of the 
Hamilton equations is concerned. But other quantities can also be used 
as the “Hamiltonian” which are related to the energy. 

From our requirement of parameter-invariance, Eq. 2.49 must be true 
for all t. The Lagrangian L has the same form for all t, but different 
numerical values. Thus, we conclude that 


OL 
C=L-v'—=0, 2.50 
Ov" 250) 
which means that L is a function that is homogeneous of the first order in 
velocities v".? We have then the relation 


L(Av*) = AL(v"). (2.50’) 


Now,’ if we write the Lagrangian as a sum of the free-particle 
Lagrangian Ly plus an interaction Lagrangian L,, 


L=Lo+y, (2.51) 


where both Ly and L, are homogeneous of the first order in velocities, 
then the Minkowski force is given by 


aL, dt d (dL; 


eee eee 5 
- ox" ds_ ds (=), CS) 


and automatically satisfies the condition K*u, =0. But the converse of 
this statement is not true. There are examples of K” satisfying K*u, = 0 
(see Sect. 1) that are not homogeneous of the first order in velocities. 
Therefore, we conclude that the requirement of the parameter invariance 
of L invokes an additional restriction on the Minkowski equation, as we 
bave observed before. 


Method of Lagrange multipliers. We now discuss the solution of the 
original variational principle, Eq. 2.35, in the variable s with the subsidiary 


1 We recall that for L = L(q, q), dL/dt = (@L/@q)q + (@L/2q)q, or, with the use of 
the equations of motion, 
“ke Gh HAS » d : dH 
—=—(— — (L — pq) = — = 0. 
qa ai = a), a OS 


2A function f(x) homogeneous of nth order satisfies the differential equation 


nf = x(af/ox), and f(Ax) = A*f(x). 


66 Electrodynamics and Classical Theory of Fields and Particles 


condition u? = 1. Let the Lagrangian L be explicitly independent of s; 
ie., L=L[x*(s), x“(s)]. We introduce a Lagrange multiplier 4/2 and 
use a new Lagrangian L defined by! 


L=L+ ; (u? — 1). (2.53) 


Now the variational principle 6 { Zds =0 leads directly to the Euler- 
Lagrange equations, 


OL d {aL d 
— | ee 2.54 
ox ds (=) am Uy) as) 
The variation with respect to the multiplier A gives, in addition, 
u?—-1=0. (2.55) 


Now we have five equations for five unknowns x"(s) and A(s). Multiplying 
Eq. 2.54 with u” and noting that u? = 1, uu = 0, we find 


haut ut (S) 


OL dx* d (oL 
= SS = Bh ee 
A [i ds aEeoaG ds [. = (=) ds 


or 


— iL _— ut ——— (2.56) 


With this, the final form of the equation of motion is 


qo =2b =(5) d L , OL 
* Ox" = ds\du") — ds ( ait Sa) ta| = 0. ee) 
Th¢ generalized momenta now must be defined by 
ee: 
Pu = oul > 
and are given by 
se OL , OL 
Pu = 37 =F (x —u sa) te (2.58) 


1L. Infeld, Bull. Acad. Pol. Sci., 5 (1957), 491. 
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The equations of motion, Eq. 2.57, are indeed of the form 
Dp, =—. (2.57’) 


One can verify easily that for the three typical examples discussed in 
Eqs. (2.40)ff, the Lagrangians to be used in connection with the equations 
of motion, Eq. 2.57, are 

L = moe, 
for a free particle, 


e 
L=my oc +- A,u", 
c 


particle in an external electromagnetic field, 
L=mge — $(x), (2.59) 


particle in an external scalar field. 
The generalized momenta are exactly the same as before. 


Lagrangian containing the subsidiary condition u? =.1. Finally, we dis- 
cuss the possibility that one can choose a Lagrangian L = L(x", u") in 
such a way that the condition u? = 1 is satisfied automatically by virtue 
of the equations of motion. Neglecting for the moment the condition 
u? = 1, we obtain from Eq. 2.36- 

é6L OL 

rea Lo 2 ties (2.60) 
provided we can choose subsequently the correct Lagrangian to satisfy 
the condition u* = 1. That this can be done is shown by the observation 
that, in the three typical examples considered, had we used the parameter s 
rather than t, we would have obtained the correct equations of motion; 
hence, u? =1 must be automatically satisfied. More precisely the 
Lagrangians 

L=mycu?, 


L = meu? + : Aut, (2.61) 


L= (moc i p)u?, 
give the correct equations of motion, from which we obtain 


q3 ey — 
K*u, = u,u" = 0, 
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or u,u" = const, which can be chosen equal to 1. The point here is to keep 
the u? in L as far as the functional dependence of LZ on u* is concerned.* 


3. Canonical Form of the Equations of Motion 


We recall that in classical mechanics the m Lagrange equations, 


a(s) = pee 


at\aq,) 2a,’ 
for the n coordinate functions q, = q,(t), can be transformed into 2n 
Hamilton equations as follows. One defines the generalized momenta 


and solves this equation for g, =4,(q;, P;, t). If this can be done, one can 
express the Hamiltonian function 


A= p,q,—L 


as a function of q, and p,: H = H(q,, p,, t). We then obtain the canonical 
equations 


Gp, So 
qk : OD, F 


In this section we discuss the corresponding equations for the relativistic 
case. 

We have already introduced the generalized momenta and noticed 
that a straightforward construction of the “Hamiltonian” for a single 
particle from the Lagrangian leads to an expression that is identically zero. 


H = u"p,-L=0. (2.62) 


However, we can still use # to derive the canonical equations by dif- 
ferentiating with respect to its functional dependence. For this purpose # 
must be expressed as a function of x" and p", which we now take as 
independent variables. 


H = W(x", p*). ( 2.62’) 


1A further form of the action principle as an integral in four-dimensional space 
will be discussed in detail in Part II. This form will be suitable when we consider par- 
ticles and fields as two physical systems in their own right and in their mutual interaction. 
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Using the equations of motion p, = @L/dx", we obtain the canonical 
equations formally from Eq. 2.62: 


(2.63) 


For a general Lagrangian combining our three typical examples; i.e., 
L=[moc — $(x)]u? + (e/c)A,u", we get 


H = u"p, — [moc — O(x)]u? — ~ Ayu", 


which must be considered as expressed as a function of x” and p,. Then 


00? edA, , OH : 


a. py AG ie Wat: gs a 
Bor Pat ~ eax ~~ Pas dp* aad 


or 

Although this method is formally correct, one can argue against it on 
two grounds; first, # is identically vanishing, and secondly, it does not 
have the meaning of energy. To avoid the first difficulty, one can use 
instead of #, the symmetric tensor 


HH” = p’u® — gL, (2.64) 
with the canonical equations 


aH" oH" 
ae ee “agg (2.65) 


To obtain a Hamiltonian that has the meaning of energy, we must 
proceed differently. 

We have seen that, in general, the canonical momentum p* consists 
of a kinetic term mocu", the momentum of a free particle, plus a term 
independent of the rest mass mp; 1.€., 


p* = mocu" + f*(o:, 4), (2.66) 


where f“ is a function of the external fields and velocities. Because u?=1, 
we get 


(p —f)’ = moc’, (2.67) 
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which generalizes the free-particle equation p? = moc”, Eq. 2.9.1 There- 
fore, the quantity (p" — f“) is the relevant and important quantity as far 
as the motion is concerned. The equation of motion can be written in 
the form 


5 ~F) =F (gout) = = = Ke (2.68) 
In this form it is the same as the Minkowski equation (Eq. 2.10), written 
now in terms of the canonical momentum p“, whereas in Eq. 2.10 we 
wrote the mcu" as the kinetic momentum p“. Now, of course, the energy 
and momentum of the particle will be different from that of a free particle. 
The energy of the interacting particle is obtained from Eq. 2.67 to be 
(assuming f is independent of velocities) 
E=cp° =f + c[(p—f)? + m2c?]!/? = H. (2.69) 
If f* depends on u“, as in the example of the tensor field, Eq. 2.45’, we 
have to re-express u“ in terms of the p" in Eq. 2.67. Now E in Eq. 2.69 
can be used as the Hamiltonian to write the canonical equations as in 
classical mechanics, 
ony aaa 
Op =X, a UE 
Equations 2.70 are not covariant, but they are perfectly relativistically 
invariant. We have already pointed out that covariance is not necessary 
(it is sufficient) for the relativistic invariance. Although H does not trans- 
form as a scalar, vector, tensor or spinor, nevertheless it has a definite trans- 
formation property, and Eqs. 2.70 can be written down in every frame. 
H transforms like the zero component of a four-vector. 
We leave the discussion of our examples from the point of view of 
Eqs. 2.70 to the exercises at the end of this chapter. We note here that 
Eq. 2.69 for small v/c becomes 


(2.70) 


il 1 
E> my0*= Ey, = of (= ae ft) 2 eee ee 
of nr = Cf 2mo (p — f) 8m 302 (p — f)* + . (2.71) 


1 In the case of a scalar external field, we find, from p# = mocu# — gut, 


i 


ye = p# . 


moc — ¢ 
hence 
¢ 2 
27] —— en 202 
pe( + 2) mo*c?, 
or 


p® = p® — p? = (moc — $)®; EE = clp® + (moc — ¢)2]2, 
We also find that 
(2 + $= moc. 
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For example, in the case of an external electromagnetic field, f* = (e/c)A", 
and we get the more familiar equations 


aa 1/2 
E=ep+ e|(p ap A) + mge?] : (2.72) 
1 gee \ 4 1 et 
E,,= —— — a = __. = Sse 
wR = eG + Wi, (p 5 A) mee (p a A) + . (2.73) 


It is still possible to write covariant canonical equations with a 
“Hamiltonian” that is a Lorentz scalar and that is related to the rest 
energy of the particle. Consider the following two scalars: 


: 
M — — 2 — 2 
Tagg (PLY? = does (2.74) 
M’ = [(p —f)*]"? = moc, (2.75) 
where f is independent of u”. From Eq. 2.74 and Eq. 2.66, we find 
OM 1 F 
ap meee —Sf,) = Uy = 3 
0M 


lt aes ‘ 
Ox" ae a Moc (p — Tata = —uUfio = — Pu; 
i.e., correct canonical equations. Similar equations hold for M’. From 
M’"? =(p—/f)?’, we get 


OM’ aM’ 
: Axe =(p"-—f hws Mie = (p, — Sy); (2.76) 


using the value of M’ = moc, we get back the previous canonical equations. 
With respect to M’, we note that in going over to quantum theory 
the operation of taking the square root of the operators is not straight- 
forward and leads to the introduction of an additional finite-dimensional 
internal space, as in the case of the Dirac theory. 
For a scalar field, the functions M and M’ are given by 


M 


= fp)*2 4 6 sige; «MM =) 4 = Me, 2.77) 
2moc 


respectively. 

To summarize our considerations on Lagrangian and canonical forms, 
we can state that there are a multitude of forms in both cases that are 
essentially equivalent, and that the problem of writing relativistic 
Lagrangian and canonical equations poses no great difficulty. 
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Also, the development of canonical transformations, Poisson brackets, 
and the Hamilton-Jacoby theory can be carried over formally to the 
relativistic case with any of the “Hamiltonian” functions we have discussed 
above. For completeness, we give a brief discussion of these topics. 

If one uses the noncovariant relativistic Hamiltonian, Eq. 2.69, the 
discussion is exactly the same as in the case of analytical mechanics and 
need not be repeated here. Therefore, we shall comment on the covariant 
forms only. 

Consider our scalar Hamiltonian M or M’, or H, which we denote by # 
in the following, meaning any one of them. Introducing an eight-dimen- 
sional phase space (p”, x“) the canonical transformations may be defined 
by the equation 

Dv" — # = piv'* — #' + =, (2.78) 
_ where F is the generator of the canonical transformation. Choosing, for 
example, F = F(x", x’, s) (there are three other forms possible, F must be 
a function of one set of all coordinates and one set of new coordinates), 
we obtain 


OF 
HP ee 
+ 35° 
_ OF 
Pu = Bye (2.79) 
; OF 
ba 


The Poisson bracket of two functionals of x" and p* will be defined 
formally as 


P, Ole 
LP, 2] Ox" cp, Op" ax, ee) 


If P and Q are scalar functions, [P, Q] is invariant under Lorentz trans- 
formations, and it has all the usual properties of the Poisson brackets. 
In particular, using Eq. 2.63, we find 


[o", H] =x"; -° [p*, H] = p*. (2.81) 


For a general scalar function P(p", x“), we obtain 
dP 
P; =—; 
[P, H] i (2.82) 


? See, for example, H. Goldstein, Op. cit., Chapter 8. 
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if P also depends explicitly on s, we have the more general formula 


ap oP : 
(Pet le 7s t aoe (2.83) 
Therefore, quantities independent of the proper time s commute with the 
Hamiltonian. 

Thus, we see that the general theory of canonical transformations and 
Poisson brackets—as well as the theory of infinitesimal canonical trans- 
formations and Hamilton-Jacoby theory (see Exercise 9)—can be put in 
covariant form for a single particle. As we have noted before, in the case 
of several particles, we cannot use a common proper time. Consequently, 
no exact relativistic theory of several interacting particles can be given by 
the methods of the present chapter. (See Chapter VI.) 


4. Electric and Magnetic Moments: Classical Spin 


In this section we shall discuss some new intrinsic properties of our 
classical relativistic mass point which manifest themselves by additional 
terms in the equation of motion. So far we have only considered a mass 
point completely characterized by its rest mass in a given external scalar, 
vector or tensor field. A spinor field is included in the sense that, in general, 
tensors can be constructed out of spinors. The spinor formulation of 
mechanics itself will be discussed in the next section. 

We start with the case of an external electromagnetic field. If the mass 
point has an intrinsic magnetic moment yp (not the orbital magnetic 
moment), then it has, in an external magnetic field B, an additional energy, 
p-B. Similarly, a particle with an intrinsic dipole moment d has an addi- 
tional energy d-E in an external electric field E. As a matter of fact, 
these moments may be defined phenomenologically by the form of the 
energy. The covariant generalization of these terms in the energy can be 
written as 


as em (2.84) 
where o*” is an antisymmetric tensor: 
0 d; d, d, 
ot” = —d; 0 430 —H2 : (2.85) 
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In the same way as electric and magnetic fields form a unity F,,, electric 
and magnetic moments form a unity. In particular, these are transformed 
into each other under Lorentz transformations. 

Because the moments are not scalars but vectors in the rest frame of 
the particle, their orientation will depend on the motion and, in general, 
will change along the trajectory. We are faced then with two problems: 

(a) Determination of the motion in the presence of the moments; 

(b) Determination of the precessions of the moments. 

We solve the problem (a) first for a constant o,,. We may start con- 
veniently from a Lagrangian, which must contain the additional term, 
Eq. 2.84: 


L = (moc + 40""F,,,)(v2)"/? + Av", (2.86) 
According to Eq. 2.38 this Lagrangian leads to the equation of motion 
d e d 
ds (mocu,) = - rye + to" F 5, = as (4o"*F,.u,). 


The last term still contains a factor du/ds, so the equation is not yet in a 
form to allow a physical interpretation. Rewriting the last term, 


du, 


d a v 4 
= Go Fu.) = ju,u,oF%, + to" F,, seteas 


and grouping terms we get 
d 1 e F y 
i (mocu,) = Am 7 Fav + fo" Fp — Ju,uo"F3), (2.87) 


where 


Am = ae oF : 
The effect of the moment terms is therefore twofold: one, to change 
effectively the mass or the charge of the particle [the factor 1/(1 + Am)}; 
and two, to add forces which depend on the gradient of the external field. 


Even for a homogeneous field, we get a deviation from the Lorentz force: 


(2 


d 
F#e = 0: — = —__ _ 
= ds (mocu,) 1+Amc 


Fite. (2.88) 
If the energy of the moments to" F,, is very small compared with the rest 
energy of the particle, Am is also very small and the Lorentz equation 
holds. 
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The above equations are not very realistic, because o”” in general is a 
function of s and we must consider the problem of the precession of the 
moments. We can use either the tensor o””, or a vector S“ dual to o”” and 
defined by 


Ouy = dEyvepPS’, (2.89) 
where p” is the momentum vector, or 
Shsahe’? pio... (2.90) 
From Eq. 2.89 and Eq. 2.90, we obtain the relations 
p,S" = 0, p,o"” = 0 = S,o"", Cuy = —Oy,. (2.91) 


Thus, spin may also be described by a four-vector orthogonal to the 
momentum p. In the rest frame of the particle p:(m, 0, 0,0), the spin 
vector S is a three-dimensional space-like vector (0, S). 

During the motion of the particle, S* or o,, will change. There are 
various methods of deriving an equation of motion for these quantities 
as a function of proper time, from which the precession of the spin follows 
immediately. Of course, the motion of the spin is such that Eqs. 2.91 are 
valid at all times. 

One simple method is to start from the equation of motion in the rest 
frame of the particle.’ In this frame, the equation of motion in external 
electromagnetic fields (E, B) is given by 
ds g\e f\e 
—= (Z)<sxe+(Z)“sxe, (2.92) 
where g and f measure the magnetic and the electric moments. The 
moments themselves are given by 


n= (2)<s: e=(Z)<s. 7 (2.93) 


We also note that mw and p as the spatial part of a vector, or as com- 
ponents of an antisymmetric tensor, transform as follows: 
j v'H ¥ : 
Mi a ae alt 
(2.94) 
F v°p i 
Pe ta) pememet vB 


where 
y= py". 


1. Bargmann, L. Michel, and V. Telegdi, Phys. Rev. Letters, 2 (1959), 435. This 
reference deals with homogeneous fields only. 
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The unique generalization of Eq. 2.92 to any frame is. 


58 (2) < LF u, + (S,F?u,)u"] 
2] m 


d v 
+ (5) [F*Y'u, + (S,F%*u,)u“] ——— Su", (2.95) 
2 dt 
where F%, is the dual of F,, and is-obtained by replacing E by +B and 
B by —E, or 

Fi, tee (2.96) 


In Eq. 2.95 the factor du"/dt in the last term is given by the equation 
of motion, Eq. 2.87, which we have determined previously. If we sub- 
stitute this equation into Eq. 2.95, we obtain the final result: 


pi 
dS* ad € |(g)Frs, dL (Z)rrs, at (3 _ 1 ) Sor ,ut 


dt m{\2 Z 2 1+Am 
ee ee So" F2,u" 
= FS Sore ae) — ae 
oe yt5 ree 2mc?(1 + Am) mee) 


.A simpler formula is obtained in the special case of homogeneous fields 
and, neglecting Am, 


dS" * . 
aia E | dae (2 = 1) 5 FePugut + D eae. = Sel usu , 


(2.98) 


This is the result obtained by Bargmann, Michel and Telegdi, referred to 
above. 

Next we discuss a Hamiltonian form of the equation of motion of the 
spin. Consider the equation of motion 
dS ([g\e 
—=(=|)—SxB 
7 (2)<sx (2.99) 
of a magnetic moment in the rest frame of the particle. Because the 
magnitude of S, S, is constant, there are only two degrees of freedom in 
the problem. We choose 


¢=S,, and 7 =tan~*(S,/S,) (2.100) 


as the canonical variables and a Hamiltonian, 


eg eg : 
H=—S§8 B= — [(S2 — £2)1/2 é 
5 S‘B = [(S* — €*)"?(B, sin y + B, cos n) + €B,]. (2.101). 
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Then the equations of motion, Eq. 2.99, are equivalent to the canonical 
equations, 


mc” oe —é. (2.102) 


More generally, we can take the magnitude S also as one of the 


variables and introduce a variable ¢ conjugate to S. Because S is constant, 
we have the additional equations 


dH dH 


The spin Lagrangian is given by 
oH oH 
Lin Hitgiinee = —H +¢ + S—_=0, (2.104) 


which is identically zero, but still allows one to derive the equations of 
motion. 

The most satisfactory description of classical spin is in terms of a 
_Lagrangian that simultaneously gives, both the equation of the motion 
for the trajectory, and for the spin. This cannot be achieved by a Lag- 
rangian of the type Eq. 2.86 because it does not contain ‘“‘coordinates” 
or “internal degrees of freedom” corresponding to spin whose variation 
would give the spin equation in the same way as the variation of the space 
time coordinates leads to the equation of the motion. We present now in 
this final paragraph a general method, independent of any specific inter- 
action model, to obtain both ae spin and momentum equations of the 
motion.’ 

Let us introduce as new internal coordinates four fourvectors 


Ga(% = 0, 1, 2, 3) 
with qo in the direction of the fourvelocity u, and the other three ortho- 
gonal to u,, hence space-like. Consider a Lagrangian that is a function 
of x, g, q and the electromagnetic field: _—_— 
L= Llu Qa)» Yay» A(X); Ayv)]- (2.105) 
The variation of this Lagrangian leads to 


oL d /{éL oL d{oéL 
Oh ieee) OE dq 
= Ee ds =(=)| ates = ds Gry | ae 


d foL OL d oL OL 
Set Oe ee a ren a —— (6A, — A, , dx" 
a be aay OX tama + ae bat] + 57 ( a ie x") 
— A, ,, Ox"). 
+o 2 Avie Xe”) 


1] should like to thank James Currin for an enlightening discussion of this point. 
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The first two expressions in brackets are just the Euler-Lagrange equations 
for x and q respectively, and these will vanish. 
If we now consider the translations 


d 
Oxa—laie a one (), OA, = 69(a) = 9, 


we obtain the equation of the motion 


OL 6L 
D —A —A 2.106 
Pu 0A, Ayu + Can Avy ( ) 
where 
Py = OL/Ox". 


Next we consider the homogeneous Lorentz transformations 
6x* = exe OA, = 6,’A, ’ 6A, = by Ans + £,°A;.» F 


. O(a) aa 89a) ? 
and obtain! 


Bea) 6, ~ ; 
ce ag, ™ + Pip} 


= A oe A‘; ea ae Aj.) . (2.107) 
Hu, [nu 


Using Eq. 2.106 we may rewrite Eq. 2.107 as follows 
Say + Pray) = Sten — MpPAryp = M4, Ary 
where 


oL 
Sa (a) 
uy agit, qQyj 


lit 


i, = OL/aA" 
M*? = —M?* = OL /6A,, 


1 We use in the following a bracket around the subscripts for an antisymmetric 
tensor formed out of two vectors: ajubs; = auby — Grby. 
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If the Lagrangian depends on the derivatives of A, only through the 
electromagnetic field tensor F,, = —F,,, M,, is antisymmetric and we can 
write 


Suv + Pty) = —jtAv — My PF yp - 


Now we assume as usual that the dependence of L upon A 1 IS given by 
(See also Chapters III and IV) 


Defining a mechanical momentum G, by 


G, = Pp, — eA, 
we obtain 
G, = eF ,,u* + 4MF,, , 
Sa + Gpytty = Mok vf 
To solve for the mechanical moment we multiply the above equations by 


u” and obtain 
Gi Miolat uS,, —u Ma’, 


where we have put 


My = —u’G,, 
then 
Syy + U* Stall; — UME Fey) 
+ U°M?F yoy) = MEF vip, 
or finally 
Syy + US ally = Tyy + U’ Tiyetly» —~ (2,108) 
where 


T= MEF vip - 


We notice that the previous equation is reduced to an identity upon 
multiplication by u,. This means that the six components of S,, are not 
all determined. However, because we assumed a Lagrangian independent 
of the accelerations u, the so-called Weysenhoff condition 


S,u" =0 (2.109) 


is automatically satisfied. Now the three spatial components of spin are 


_ 3 1303 00076 3830 


80 Electrodynamics and Classical Theory of Fields and Particles 


determined. We may further assume that the magnetic moment M,, is 
proportional to the spin 


M,, = ™S,,y 


One may now derive from Eq. 2.108, as a special case, the equation 
of Bargmann, Michel and Telegdi discussed above and the fact that 
(mo + o”’F,,) is a constant of the motion. 


5. Further Developments and Exercises 


1. Consider relativistic energy conservation in nuclear reactions and 
verify the relations of Eq. 2.13 to Eq. 2.17. 

2. Derive formulae connecting laboratory and center-of-mass frame 
energies and momenta, as well as scattering angles for two-body reactions. 

3. By analogy to the photon define the neutrino as a mass-zero rela- 
tivistic particle associated with the plane-wave solutions of the spinor- 
wave equation. 

4. Solve the equation of motion, Eq. 2.22, of an electron moving in a 
fixed electromagnetic field in the simple case of constant electric and 
magnetic fields. (L. Landau and E. Lifshitz, Classical Theory of Fields, 
Addison-Wesley, Cambridge, Mass. (1951), p. 48-56.) 

5. Solve the relativistic Kepler problem. 


6. A solution for a relativistic particle in an external scalar field. Con- 
sider the Lagrangian, Eq. 2.45, or the equations of motion, Eq. 2.27. 
Assume that in a particular frame ¢ does not depend on x° (note that i) 
is a function of x”); then 


dx°® 
(moc* — ¢) —— = const. 
ds 
Choose ¢ = ¢(r) = g’/r and the constant equal to moCc?: 
g? 
Show that if one chooses a + sign for r>ry anda — sign for r<ro, 
one obtains a motion such that a particle approaches a force center until 


r= ro/2, returns, and goes back to infinity. [J. Werle, Bull. Acad. Pol. 1, 
(1953) 281; ibid. 5, (1957) 491]. 
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7. Spinor Mechanics. Introduce two component spinors € by the 
relation 
a! ax _. ra 
=e = Co ay 
ds Ga Ng 
where the right-hand side is a four-vector under proper Lorentz trans- 
formations. The free Lagrangian, with the Lagrange multipliers, 
(= - dn, 


CSS op Xn 
o=5(Go Ma — Se Te) + Ani ¢o"n), 


leads to the equations of motion, 


dé ». an 

ee teAaGO”; 7 ae aL 
adx* di* 
_—_ = tad —_ = 

ds son ds 


If A is any 2 x 2 matrix, we get, with [A, B] = AB — BA, 
“ (An) = A,éLo", A} 
ds 4 ion Ag > y. 


If, for convenience, we wish to have parity conservation also, in the 
above equations we can replace €, 7 by four component spinors and o” 
by the Dirac matrices y“[A. Proca, J. Phys. Radium 15, (1954) 5], with 
yy” + y’y4 = 2g"”. The spin of the classical particle can be described by 
taking 

A= RY — Vyas 
The interaction Lagrangian can be taken to be of the form 
Lin = cOn, ee 


where for electromagnetic interactions O = eA, y*. Show that the Lorentz 
force can be derived from this equation. 


8. Thomas Precession. Consider an electron moving in a circular orbit. 
In the equation of motion of a spin in a rest frame, Eq. 2.92, without the 


electric moment term, 
dS g\e 
—={-|}—S xB, 
dt (3) m 


the magnetic field refers to that in the rest frame. Let the electric and 
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magnetic fields in the laboratory frame be E’ and B’ respectively. We 
must either transform E’ and B’ into a rotating frame of reference, or, if 
we transform them by a proper Lorentz transformation 


Be (p-* xr), 
c 


we must keep in mind that now we have a “rest frame” that is rotating. 
In the latter case, the equation of motion of the spin is 
dS (= 


) —o,xS, 
dt nonrotating 


where wy is the angular frequency of the Thomas precession, to be ob- 
tained as follows: Consider an electron in a circular path at two neigh- 
boring points P and P’, with velocities v and v + dv at these points. The 
laboratory frame is related to the rest frame at P by a Lorentz transfor- 
mation with velocity v, and to the rest frame at P’ by a Lorentz trans- 
formation with velocity vy + dv. The two rest frames at P and P’ are 
related by a Lorentz transformation plus a rotation, and is of the form 

x” =x’ +x’ x AQ; 
therefore, 

1 vxa_ 1 

OQ; = lap -}| Puma 

9. Discuss infinitesimal canonical transformations in covariant form 
for a single particle and the relation of their generators to the constants 
of motion. 

10. Write the covariant Hamilton-Jacoby equation for a single particle. 

11. Derive Eqs. 2.102, 2.103, and 2.104. 

12. Consider a system of particles, and let f(p)d°p be the number of 
particles with momentum in the range p and p + dp. Show that /(p)E is 
an invariant quantity under Lorentz transformations. Here E is the energy 
of the particle. Hint: Use the invariance of d*p and the relation 


d*p 5(p? — m?) = d?p(m/E) 


13. Show that for two colliding monochromatic beams of particles the 
invariant current density is given by 


j= (ky k.y - mim3]'/?/mym, 


where k, and k, are the four-momenta of the particles and m, and m, 
their masses. Hint: Take a special frame in which the particles are moving 
towards each other along a straight line. In this frame the current density 
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is j = (v, + v2)/V,V,, where v; (i = 1, 2) are the velocities of the particles, 
v; = p;/E; (p is the magnitude of the three-momentum, E the energy) and 
V; = m,/E; are the invariant normalization volume. 

14. Extend problems 12 and 13 to the case of mass zero particles. 

15. Let P be a point on the world line of a particle. Introduce at this 
point an orthogonal basis (Vierbein or fourleg) 


i = g*? 
(2); 
Gri uy, 


where u, is the fourvelocity of the particle. Show that under the equation 
of motion 

du’ 

ds 


eT) = (i) 
Qe, = u,e, 


ds 
e(s) = u,(s) 


the basis remains an orthogonal basis along the world line of the particle. 
16. The so-called Fermi Coordinates of a point A with respect to a 
world line with the velocity vector u,(s) are defined as follows: 


sane” 
x = 5 


where d is the distance of A from the worldline (distance of the geodesic in 
the case of general relativity) and e™ is the basis introduced in the previous 
problem, n” the normal to the world line at the intersection of the normal 
to the point A, and s is the length of the world line from some origin P 
to the intersection point. Show that these coordinates are invariant. 
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II. 


jo ete 
FIELD THEORY 


1. Intuitive Introduction of Fields 


We have already met the concept of fie/d in a limited sense, namely as 
an external-force field on a particle. We have spoken of given external 
scalar or electromagnetic fields, producing an “‘action-at-a-distance”’ type 
of force on the particle. More fundamentally, a field is a mechanical 
system in its own right, carries energy and has, in general, a continuously 
infinite number of degrees of freedom. It is a system filling the whole space 
like a “‘fluid’”’, and it has physical reality. Whereas in the previous chapter 
the mass point was the physical system under consideration and the fields 
were merely auxiliary (phenomenological) quantities, in this chapter the 
fields will be our physical system, and we shall describe and seek the laws 
obeyed by the fields themselves. The more realistic (classical) description 
of the physical systems must consider the interacting fields and particles 
together. This situation will be discussed in the second part of the 
book. However, if the interaction with the particles is weak, the laws 
of the so-called free fields can be observed directly, as in the case of a 
radiation field, for example. 

The infinite number of degrees of freedom of the field must be described 
by continuous indices. Instead of the coordinates 9,, g2, °** , the dynamical 
variables of the field will be a set of functions W°(x, t), «=1,---, N, 
where (x, f) are now parameters which, together with a, label the degrees 
of freedom of the system. It is then possible to formulate a completely 
relativistic and elegant theory. However, some divergence difficulties are 
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present with a continuously infinite number of degrees of freedom. AI- 
though it is rather straightforward to formulate a relativistic theory for a 
single particle or, at the other extreme, for a system with an infinite number 
of degrees of freedom, it seems very difficult to formulate a relativistic 
theory for a finite number of particles. 

We first illustrate the passage from a mechanical system to a field in 
terms of a simple one-dimensional model. Consider a linear chain of mass 
points, and let w; be the displacement of the i‘ particle from its equili- 
brium position, and let a be the equilibrium separation between the mass 
points. Let us assume that forces acting on each particle come from the 
two neighboring particles and that the effect of the remaining mass points 
is negligible. For simplicity we consider the force between two mass points 
to be proportional to their separations. The force on the i‘ particle is, 


thus, 


OV 
F,= — av, = KWis. - Wi) = ky; = Wi-1)- 


The Lagrangian of the total system is given by 
a y [mb? — kis, — W)71, 


where the dots on the dynamical variables y; denote time derivatives. We 
have a discrete but infinite number of degrees of freedom, and our treat- 
ment at present is nonrelativistic. For each mass point i the above 
Lagrangian yields the equation of motion 


mi — Kiar — Wi) + KW; — Wi-1) = 0. 
Now we go to the limit a 0 in such a way that m,/a approaches a 


finite value p(x). The discrete index i becomes a continuous index x. Then 
we have the following limiting relations: 


Wilt) > WC, 1), 
Wis Vii vi, oy 
a ie : 
«(Ha Wi Wi- Wi moet) ay 
a a a 0x2’ 
kay, 


where Y is a constant, namely Young’s modulus, because it is the propor- 
tionality constant between the force and the displacement per unit length: 


rasa 
a 
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The limit of the Lagrangian is, then, 


L= i ax| |v? - (4) - | axee, ') ae 


where we have introduced a Lagrangian density L(x, t). In this limit the 
action principle 5 | L dt = 0 becomes 


6) dxdt =0. 


Instead of the variation of the coordinates w;, we have now a variation 
of the function w(x, t). In mathematical language we have here a varia- 
tional problem in two variables (x and t). The mathematical theory of the 
variational problem in several variables and for several functions is 
standard and very straightforward.! 

The limit of the equation of motion is 
lel 
weir “Ava 
which is a wave equation with a propagation velocity v =./Y/p. Here 
W(x, ft) represents the amplitude of the elastic waves. The wave equation 
coincides with the Euler-Lagrange equations, 


(@)-a le) -4° 
at\ep) dx \ay*]) sy” 


where f’* =0y//0x, which itself is the limit of the Euler-Lagrange equations 
for the discrete system. 

Generalizing this example, we see that we can define the dynamics of 
the field by an action principle 

5j Y d*x dt =0, 
where the Lagrangian density Y is a function of the field variables 
W(x, t), «=1,-+-, N, and their derivatives. It is clear that if Y is a 
Lorentz scalar, the action principle will take an invariant form~ — 

6) £ d*x =0, (3.1) 
and the resultant field equations will be invariant under Lorentz trans- 
formations. It is this fact that makes possible a relativistic theory of a 
system with continuously infinite degrees of freedom. 

Before we elaborate on the action principle, Eq. 3.1, we introduce in 
the next section the prototype and the most important of all fields, namely 
the electromagnetic field. We discuss inductively the basic equations and 
demonstrate their relativistic invariance. 


py — =0, 


1 See, for example, R. Courant and D. Hilbert, Methods of Mathematical Physics 
(New York: Interscience Publishers, 1953), Vol. I. 
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2. The Electromagnetic Field 


The description of the interaction between charged particles by the 
intermediary of a field as an energy-carrying physical system goes back 
to Faraday and Maxwell. The field is produced by the charged particles— 
whose existence we assume as the starting point—and is measured by the 
acceleration that it produces when acting (again) on charged particles. 
The concept of the electromagnetic field is perhaps best introduced 
deductively by its basic equations, Maxwell’s equations and the auxiliary 
equations. However, we shall proceed in a slightly inductive manner. 


Basic Equations’ 


Charged particles at rest are acted on by a force eE, where e is an 
attribute of the particle, its charge.? Therefore the electric field E may be 
introduced as the electric force per unit charge in the same way as the 
gravitational field g is defined to be the gravitational force per unit mass. 
Similarly, the magnetic field B may be formally introduced as the force 
per unit magnetic pole. However, the meaning of the magnetic monopoles 
is not very clear, so we go back to the general expression for the force 
acting on a moving particle discussed in Chapter II. If the charged particle 
is in motion, the force acting on it is given by 


F=e(E+v xB). (322) 
This equation may be taken as the definition of the electric and magnetic 
fields. We also know from the conservation of energy that E = —V¢@, or 


V x E=0, and that the magnetic field B does not do any work. We 
emphasized in Chapter II that E and B are different aspects of the same 
thing, the electromagnetic field F,,. 

In material media the existence of many charged particles, and of 
atomic and nuclear structures, makes a continuous description difficult. 
This model of matter is a quantum-mechanical one. Nevertheless, one 
can introduce, phenomenologically, two new quantities D defined by 


p D-df = Q, (3.3) 
F 


1 In this section we use rationalized mks units, the four fundamental units length 
! (meter), mass m (kilogram), time ¢ (second) and charge g (coulomb) and Eoz0 = 1/c?. 
Of course, the number of fundamental units or the choice of them is arbitrary and has 
no fundamental significance. (Later on we shall find it convenient to put c = 1.) 

° For the interpretation of e as the coupling constant between the charged particle 
and the electromagnetic field, see Part II. 
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where Q is the total charge inside the closed surface F, and H = pole 
strength/unit area, Presently we shall give alternate expressions and inter- 
pretations for these quantities. 
The fundamental postulates of SieoSANiics in material media 
(Maxwell’s electrodynamics) are: 
(1) Faraday’s law, 
A i df = -$ E-ds ; (3.4) 
dt Cc . 


(2) Ampére’s law, 


| C-df x H-ds, (3.5) 
F (a 4 


where C = J + dD/dt, and J is the current vector. 

Faraday’s law is a kind of relativity effect with respect to the motion. 
If we move a closed circuit in a magnetic field with a velocity v, the charges 
will be acted on by a force v x B. On the other hand, if we hold the circuit 
fixed and move the “magnets,” the same effect should occur. But now, 
because the charges are at rest, they must be acted on by an electric 
force eE. The equality of these two effects is expressed by Eq. 3.4.1 

In particular, for closed surfaces Eqs. 3.4 and 3.5 become 


§ B-df = constant in time; 
§ C-df = 0. (3.6) 


The second of these equations indicates that the electric current lines are 
closed. Because B is finite and the space is homogeneous, the same must 
hold for magnetic lines in agreement with observation; i.e., 


§ B-df = 0. (3.7) 
From Eqs. 3.6 and 3.3, we obtain 


“§ D-df + § J-df=0, 


or 
dQ 
‘dt 


This is the equation of continuity, which implies that if J = 0, the total 
charge Q is constant. One can furthermore introduce a charge density p 


defined by 
[p-a-| pdy, (3.9) 
ie LA 


+ §J-df =0. (3.8) 


1 See Exercise 1. 
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or, by Gauss’s theorem, 
p=V'D; (3.9’) 
then the equation of continuity, Eq. 3.8, can be written in differential 
form as 
p6+V-J=0. (3.10) 
The three basic equations, Eqs. 3.4, 3.5, and 3.7 can also be written 
in differential form: 


B=-VxE; 
C=D+J=VxH; (3.11) 
V-B=0. 


However, these seven equations are not sufficient to determine the electro- 
magnetic field (E, B, D, H) for a given J. One has then to make use of 
empirical or phenomenological and approximate relations. 

Empirically, charges in free space and at rest produce an electric field 
given by 


ae i 
E =— | —rd? : 
ale max, (3.12) 
from which we obtain 
1 
V-E=— p. (3.13) 
0) 


Similarly, stationary currents produce a magnetic field given by 
Bax a § (I x /r) dx, (3.14) 


which leads to the relation 
V x B= pod. (3.15) 


True to their empirical character, Eqs. 3.13 and 3.15 contain two new 
empirical constants &) and pig. So far we have used charges and currents 
in free space. If matter is present, charges and currents that make up 
matter will be influenced by external electromagnetic fields. The charges 
will be displaced from their equilibrium position, giving rise to electric 
moments and polarization. The polarization P is defined as the electric 
moment per unit volume. The displaced charges will produce an additional 
electric field whose effective density is given by 


1 
—p = =VeP. (3.16) 
Eo 
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Therefore, in matter, instead of Eq. 3.13, we will have the relation! 


1 
V-E=—(p+p’), 
£0 
or 
p = V-(eoE + P) = V:(eE). (3.17) 


If we compare Eq. 3.17 with Eq. 3.9’, we get the first of our empirical 
relations 
D=e.E+P=cE. (3.18) 


This equation defines the dielectric constant of matter in terms of &) and P. 
Similarly, the circular orbits of the charges inside matter constitute a 
magnetic moment M, which can be related to an effective new current by 


J’=V x M. (3.19) 
Therefore, in matter, instead of Eq. 3.15, we have the relation 
Vx B=p,(3 + J’), 


or 
1 
J=Vx (—B-m). (3.20) 
Ho 
We define a new quantity 
1 1 
H=—B-—-M=-B, (3.21) 
Ho B 


which we introduced before without much elaborating. Here py is the 
magnetic permeability of matter. We have, then, 


Vex 


if, in addition to the stationary currents J, we have displacement currents 
given by D, we obtain the second of the fundamental equations, Eq. 3.11, 
or Ampére’s law. At the same time, Eq. 3.21 is the second of our empirical 
relations. 

It is important to note that an essentially quantum-mechanical model 
of matter, with its bound and circulating charges, was used to derive the 
above empirical relations. A mixture of positive or negative charges, as 
in a plasma, would behave quite differently. Any further investigation of 
D and H, and hence ¢ and yp, must make use of the detailed structure of 
matter. 

Historically, H played the role of the magnetic field vector instead 
of B. The reason for this is as follows. The mechanical momentum in an 


1 See Exercise 2. 
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external magnetic field B, due to the magnetic moment M of matter, is 
B x M=(yu,.H + M) x M=y,.H x M=HxM’, 


where we have introduced M’ = p)M. Thus the magnetic moment seems 
to be evaluated with H if we use M’ as the effective magnetic moment. 
Usually a third empirical relation is introduced, namely Ohm’s law, 


J=oE. (3.22) 


This relation is on a slightly different basis than the previous two empirical 
equations are, because we may solve Eqs. 3.11 with J given, without 
using Ohm’s law. 

With the phenomenological equations, it is then possible to solve the 
fundamental equations, Eq. 3.11. For example, the elimination of D and 
H gives 


(: = 75) = —V x(V x E) 
YO a ° 
in the special case p = 0, V-E = 0, 

@E oE 


uaz ton = AE ; 


exactly the same equations hold for B. 


Lorentz Electrodynamics 


We shall consider now the so-called “microscopic equations” for point 
charges in vacuum. No complications due to the atomic structure of 
matter are present. We assume that the same basic laws are valid micro- 
scopically as was the case macroscopically. Because no matter is present, 
we can identify D and E, B and H. The current J is due entirely to the 
motion of charged particles; hence J = pv(xt), where v(x?) is the velocity 
of the charges at point x at time ¢. The fundamental equations become 

B=-VxE; 
E+pv=V xB; (3.23) 
V-B=0; 
V-E=p, 
where the two functions, velocity and charge distribution are related by 
the equation of continuity 


p+V-(pv) =0. (3.24) 
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Actually these microscopic equations must be taken as the more funda- 
mental equations, The macroscopic equations, Eqs. 3.11, must be obtained 
from here by some statistical averaging process, using the structure of 
material media. This is indeed the case.* 

In particular, we see from Eq. 3.23 that for empty space p =0, E 
and B satisfy the wave equation 


O7E=0, O7B =0. (3.25) 


The system of equations, Eqs. 3.23, can be further simplified by the 
introduction of the potentials @ and A defined by 


ay 


1 
bev -— A BEV XA, (3.26) 
cot 
which already have been discussed briefly in Chapter II. The first and 
third equations in Eqs. 3.23 are satisfied automatically, and we are left 
with the following two equations: 
i od 16 
AA — V(V-A) = -25(v6+254) +Py; 
c ot c 
v-(vo+t2) = i 
corr 
However, the simplification obtained by the introduction of ¢@ and A 
must be paid for by the fact that @ and A are not unique for a given 
field (E, B). We shall discuss this point in detail later. 
Next, we shall discuss the covariant form of the fundamental equations 
in order to demonstrate the relativistic invariance of electrodynamics. 


Covariant Form of Maxwell-Lorentz Equations 


From the potentials ¢, A, we pass to the four-vector potentials 
A": (9, A); (3.28) 


then the defining equations, Eqs. 3.26, can be written in simple tensor 
form: 
eA A, (3.29) 
as has been discussed already in Chapter II, Eqs. 2.20 and 2.21. 
In addition, we make the physical postulate that the total charge Q, 
or equivalently the equation of continuity, is invariant under Lorentz 
transformations. Evidently the value of the total charge Q is the same in 


1 See Exercise 7. 
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every coordinate frame. In analogy to the four-momentum of a particle 
p” = mycu", we define now a four-current density j" by 

it 


dx 
i ea 3.30 
P= 0A (3.30) 


where po is a rest-charge density and is an invariant quantity, and 
dt 
— C— 
P= Po as 


is the actual charge density which transforms like energy (zero component 
of a four-vector). The equation of continuity can now be written as 


Oo 
j= = + V-(pv) =0, (3.31) 


and is simply the four-dimensional divergence of the current ‘vector. We 
also note that the current vector always has a constant length: 


ii, =P = pie’. (3.32) 


In terms of the F,, and the j" the first and the third equations of 
Eqs. 3.23 are indeed satisfied automatically, for any antisymmetric tensor 
F,, satisfies the identity 


as = EP yoip liven = 0, 


or 

Fyap + Foy + Foye = 0. (3.33) 

The remaining two equations, Eq. 3.27, can be written in the simple form 
1 

ES pt SYe, (3.34) 


which is the covariant form of the Maxwell equations. The last equation, , 
by the way, also contains the equation of continuity, for F“” ,, is auto- 
matically zero because F*” is antisymmetric. 


Gauge Transformations 


We have already mentioned that potentials do not determine the 
electromagnetic field uniquely. A transformation of potentials of the form 


el sigs ahi (3.35) 


where A(x) is an arbitrary scalar function, does not effect the field com- 
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ponents. The field calculated from A’, is the same as the one calculated 
from A,: : 
teed ey, — ee A= F,,,. 


Therefore, only those quantities invariant under the gauge transformations, 
Eq. 3.35, will have direct experimental significance. Such quantities or 
expressions will be called gauge invariant. 

On the other hand, we can make use of the freedom given by gauge 
transformations to simplify the field equations, Eqs. 3.34. If we choose 
A, such that 


OA" 
—— = 0, (Lorentz gauge) (3.36) 


then the field equations become simply 
; . 
(744 = a The (3.37) 


The choice of Lorentz gauge can always be made by adding to A, a term 
A, such that A,” = —A,™. Even with the choice of Lorentz gauge, the 
field is not determined uniquely: we can add to the potential A” the 
divergence f,, of any solution of the d’Alembert equation []?f = 0, with- 
out changing the field. 

In the second part of this book, we shall also use a different gauge: 


V-A=30, (radiation gauge). (3.36) 
Another useful choice of potential is 
o=0, V-A=0, (Coulomb gauge). (3.36’) 
Invariants 


From the vector A” or tensor F“” and their derivatives, many quantities 
invariant under Lorentz transformations can be constructed. Two im- 
portant invariants containing only the fields F*’, which therefore are 
also gauge invariant, are 


HY Fe 
F sa 


(3.38) 


uvip 
€ 2 ee 


or, in terms of the electric and magnetic fields, (E? — B?) and E-B. As a 
consequence, we have statements of the form that if E and B are orthogonal 
or have equal magnitudes in one frame of reference, they are orthogonal 
and have equal magnitudes in all frame of references. 
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Covariant Form of the Maxwell Equations in Material Media 


If we introduce two new antisymmetric tensors! 


0 -cD, -cD, -cD, 0 cB, cB, cB; 

Gl = 0 —H, H, Ke = 0 -E; E, ; 
0 -H, 0 -E, 
0 0 

(3.39) 


and a current density four-vector 
J* = (cp, J), (3.40) 


then the fundamental Eqs. 3.11, together with V-D = p, can be written 
in the form 


(3.41) 
KS 0: 
The two tensors are coupled by the phenomenological Eqs. 3.18 and 3.21, 
or 
GP EPSEe (3.42) 
where, in the case of homogeneous media, E*’”? is the complete anti- 
symmetric symbol with elements 


BE’? =¢e; y=1,2,3: 
Bite Wace k— 1,2, 3, 


If the material medium is not homogeneous, E“’’? is a more complicated 
tensor, ¢ and yu being 3 x 3 tensors themselves. From the transformation 
property of E“’’?, we can thus obtain the transformation properties of 
é;; and y;; under Lorentz transformations.? The phenomenological Ohm’s 
law gives another connection between G4” and K,,, namely, 


C= ee (3.43) 
where Sak is, for homogeneous media, the completely antisymmetric 
tensor, with )'1?3 =o, 

Again we can introduce potentials 
a":(a°,a); bY = (6°, b), 
such that 
Guy = Ayn — Any 5 K,, = by, = Day ’ 


1 For the corresponding forms of Guy and G", ---, see- Eq. 1.104. 
2 Exercise 6, 
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in order to simplify the field equations to the form 
Oa" = j*, 
Ob" = 0, 
by a suitable choice of potential. 


(3.44) 


Spinor Form of the Maxwell Equations*! 


The covariant form of the Maxwell equations in terms of the vector 
field A,(x) is the most natural one; however, it is not the only possible 
covariant form. We have seen in Chapter I that there are other ways to 
express the Lorentz invariance of a theory. In particular, it was emphasized 
that the spinor representations of the Lorentz group, in terms of two- 
component spinors, were more fundamental in the sense that vector and 
tensor representations can be built up by combinations of these spinors. 
Therefore it is interesting, and for some purposes useful, to have a spinor 
representation of the electromagnetic field. 

There are various ways of writing the Maxwell equations in spinor 
form. The most natural way is to use the correspondence between tensors 
and spinors discussed in Chapter I. According to this correspondence [see 
Eq. 1.60, Chapter I], a vector index is equivalent to two spinor indices, 
one dotted and the other undotted. For the electromagnetic field we have, 
therefore, 

Fe Fhe 
where 
Eis Lae? 
In addition, we know that F*” is a real antisymmetric tensor. 

This condition allows us to describe the electromagnetic field by a 
second-rank spinor, rather than fourth. For, if F“” is real and skew sym- 
metric, there exists a uniquely determined symmetric second-rank spinor 
¢* such that 

F2%bs — 4p C# ae C¥ Gh]. 
Note that on the right-hand side C is antisymmetric, so that F is split 
into a part symmetric in (a, 8), antisymmetric in (é, 6) and a-part sym- 
metric in (¢, §) and antisymmetric in (a, ). 

Because ¢,, is uniquely determined, the electromagnetic field can be 
described by a symmetric second-rank spinor. If we define derivatives 
with respect to spinor indices by the correspondence 

Cte, O, 


1 Sections marked with an asterisk may be omitted at first reading. 
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and the equivalent of the current vector by 
aa = oe 
the Maxwell equations, Eqs. 3.34 can be written in purely spinorial form: 
: digs, 
oO bap = — = BF (3.45) 
The Maxwell equations can also be rewritten in the form of neutrino 
equations. If we define a four-vector G, by 
G=E + iB, and Gy = 0, 
then we can write the Maxwell equations in the form 


6G, 1 


Bost = —— iy, (3.46) 


where B, are four 4 x 4 hermitian matrices obeying the relations 
BoB, = B,Bo; pels p= Opdy293% 
BBs = 55; — teijBy 5 i,j,k = 1,.2,.33 €55= 1. 


The explicit form of the matrices are given by 


- 0 
Bo =I x1; p= ( < 5) = 43 @ a5 
f0 I 0 «6 


where the o’s are Pauli matrices. Thus the B matrices are the Kroenecher 
products (@) of the Pauli matrices. 
By a similarity transformation on f’s of the form Bi, = S~"B,S, with 


ec Lat 
sat |-ior 7 0 
V2} 0 i oO -il? 
C0 


we can reduce the B matrices fully to 
Bo = Bo; B, = 03 @a;; £2 = —03@0,; 83 = -1,@o,, 


so that the four-component Eq. 3.46 splits into two two-component 
equations (see Exercise 11). Unlike the neutrino equation, these two 
equations are coupled through the subsidiary condition Gy = 0, which 
we introduced above. 
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Equation 3.46 has the general form of the wave equations for all spin 
values. The Dirac equation and various forms of the so-called Kemmer 
equations have the same form. 


3. Lagrangian Form of Field Equations 


We have seen that fields as physical systems will be described by a set 
of functions W*(x, t), «= 1, ---, N, satisfying-certain partial differential 
equations, the so-called field equations. (Instead of the initial conditions 
in ordinary differential equations of mechanics, we have here boundary 
conditions to ensure a unique solution.) For example, the field in the 
example given in Sect. | is described by a single function, and the electro- 
magnetic field in vacuum must be described either by six functions, three 
components of E and three components of B, or by four functions A” 
which, however, are not unique, or by the spinor functions @,,. In this 
section we shall discuss quite generally field equations for arbitrary fields 
from the standpoint of the action principle, which also guarantees auto- 
matically the relativistic invariance of the theory. 

Let w(x) be a set of real or complex field quantities that may be scalar, 
vector, spinor, or tensor functions of the four-vector x. The Lagrangian 
density, which we shall denote from now on by L and refer to simply as 
Lagrangian, 


L = LW*(x), W(x), x"), (3.47) 


is assumed to be a scalar and to depend, for the time being, on the first 
derivatives of y*(x) only. 
We consider the variational principle 


| Ldx=0, dx =d*x=dx°dx'dx*dx*, (3.48) 
. : 


where R is a four-dimensional space-time region with a three-dimensional 
boundary B. We shall specify precisely the variation 6, and we shall 
consider several different variations. 

(A) Variations with a Fixed Boundary 5 


In this case, clearly, the variational principle becomes 


[ Lax= | stae=o. (3.49) 
R R 
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We can then write, from _ 3.47, the variation of L explicitly 
oer a4 ae id 
i oy? + ie. 
The coordinates x” are not varied; dL is due entirely to the variation of 
the field quantities. We can rewrite 6L in the form 


OL OL OL 
6 oy* + oy" }. 3.50 
a= (5~ age) lie ‘) a 
We shall often use the following short notations 
~ OL oL 
=— Le —, A, =L,-0,Le. 3.50’ 
L, oy? 2 (3 F;) Hu ( ) 


Then 
| OL dx -| A,6W* dx +{ OLE Sy") dx ; 
R R R 


the last divergence term can be transformed by Gauss’s theorem (see 
Chapter I, Sect. 8) into a 3-dimensional surface integral: 


| 5L dx = { AoW dx + [ do, LE SW. (3.51) 
R R B 


Now, we further specify the variation 6 in such a way that dy* vanishes 
on the boundary B of R, but is completely arbitrary elsewhere. The last 
term in Eq. 3.51 then vanishes. Because 6” are arbitrary in R, the action 
principle, Eq. 3.49, therefore implies the equations 
OL OL 

ie aye v - °( a5) = (3.52) 
These equations will be taken as field equations. To summarize, we have 
shown the following result: Under a variation that does not change the 
region of integration, nor the coordinates, and such that the variation of 
the field quantities y* vanish on the boundary of the integration region, 
the action principle, Eq. 3.49, gives the field equations, Eqs. 3.52. Physic- 
ally, the actual field quantities make the action integral J L dx an extremum 
under the above variation. It is the philosophy of the Lagrangian field 
theory that this principle is valid for all physical systems. Therefore, the 
dynamics of the system is specified once the Lagrangian is given. Of course, 
the variational principle, does not tell us how to choose the Lagrangian. 
The choice of the proper Lagrangian is the more physical aspect of the 
theory. The mathematical part of the theory is to obtain and solve the 
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field equations. Often one is not interested in the complete solutions of the 
partial differential equations. Again the physics will tell us what type of 
information is meaningful and how to obtain it. The situation here is 
quite analogous to other branches of physics. In mechanics the specific 
form of the forces, or the Hamiltonian, must be given in order to solve 
the equations of motion. In practice one knows the Lagrangian or the 
Hamiltonian of simple systems. One constructs complicated ones from 
their combinations or generalizations, as the case may be. 

It is important to note here that the specification of a field by a 
Lagrangian is not unique. The Lagrangian determines the field equations 
up to an additive divergence term. For, if we add to L a term 0,T"(y, x), 
we get 


| +apyac=5 Lax+o[ do,0* = 5 | L ds, (3.53) 
R R B 


because (x) is fixed on the boundary B under our variation. Notice that 
T* does not depend on the derivatives of w(x). Therefore, we have the 
result that the set of Lagrangians {ZL + 0,1}, with arbitrary I", refers 
to thé same physical system. 

If the field quantities w(x) are complex, we can take either their real 
and imaginary parts separately as independent field variables, or, alter- 
natively, y*(x) and w**(x) as independent quantities. In the latter case we 
will get similar field equations, Eqs. 3.52, for the complex conjugate y**(x). 

Examples. Table I shows examples of Lagrangians and the field equa- 
tions derived from them, according to Eq. 3.52. In all these examples the 
field quantities y*(x) are ordinary real or complex numbers (c-numbers), 
although some of the fields mentioned (Dirac, Klein-Gordon fields) have 
quantum-mechanical importance. 

Several forms of L are listed for the free electromagnetic field. These 
either differ from each other by a divergence term according to the above 
discussion, or use F,,, or A,,, or both, as the field quantities that are varied. 
In L,, A, are taken to be the field variables and not F,,, although L, is 
written in terms of F,, to show its gauge invariant character. We have, 
then, 

OL, OL 


——=), =F 
oA" aa" 


. 
HY? 


hence Eq. 3.52 gives F*”,, = 0, or (_]’A, =0, if we choose the subsidiary 
conditions 


on the potentials. 
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Ly is not gauge-invariant (Fermi-Lagrangian), but it gives the field 
equations [_]?A, = 0 without subsidiary conditions. L,,, differs from Li 
by a divergence term 4 0,(A,A"”) if 4 =0. Finally in L,, we consider 
both A, and F*” as independent field quantities (Schwinger-Lagrangian). 
The variation of L,y with respect to both of these quantities gives 


Ane 0; Fi’ =0, 
and the connection between F,, and A,, namely . 
Bua= Ay, — Agy: 


For the Dirac field a 4-component complex field quantity (x), 
oe = 123045 and 
W(x) = W(x)? 


are taken as the 8 independent field variables to be varied. Here the y" 
are four 4 x 4 matrices satisfying 


int it le 
The two Lagrangians for the Dirac field differ by a divergence term 
4 10, Wy"). 


The last example, the interaction of Dirac and Maxwell fields is dis- 
cussed in the second part of the book. 


(B) Variations Involving a Change of Boundary B 


To distinguish from the previous case, we denote this variation by 6’. 
This variation involves a variation of the field quantities 


W(x) = W(x) + W(x), 
and a variation of the coordinates 
ee OX. 


Let R’ be the new varied region of integration. Then 


x [rax=[ ocax+| Lax={ sax | 1 do,bx" (3.54) 
R R R'-R R B 


This is illustrated in Fig. 4. The first term on the right-hand side has been 
evaluated already in Eq. 3.51. Thus, 


6’ { Ldx= { A, OW" dx + { do ,LLZow* + Léx"]. (3.55) 
R R B 
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Fig. 4. Variation of the region of integration in the Action Principle 
| L dx. The integral over the shaded region is equal to | do,dxel 
R 


Assuming now that the field equations are satisfied, i.e., A, =0, we 
get 


5’ | Tighe | do,[L#5y* + Léx"]. (3.56) 
R B 


This equation relates the 5’ variation of the action to a definite boundary 
integral. We now re-express dy* in terms of variation of fields at the 
corresponding points under variation; i.e., 
ti / , a , ow 
OY = OY AT WC + Ox ee aia oxel 

ow compares the two fields at the same space-time point, whereas 6’ 
compares them at the point x and the varied point x’, respectively. In 
terms of 6’~, Eq. 3.56 becomes 


| Lax | do,[Li5'W* + (giL ~ W,L)5x"] =0. (3.57) 
R B 


Therefore, the new variation principle with the variation of the bound- 
ary gives the equation, after transforming Eq. 3.57 into a volume integral, 
6,[L20'p* + (ghL — WL) dx"] = 0. (3.57’) 

In the next section we use Eqs. 3.56 and 3.57 to discuss the conservation 


laws. Equation 3.57’ is used for an alternate derivation of the symmetric 
energy momentum tensor (see the solution of Exercise 10). 
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4. Conservation Laws 


The principle of relativity has played a central réle in our development 
so far. According to this principle, we are describing permanencies of 
nature independent of any coordinate frame. Closely related with this fact 
is the existence of certain quantities whose values do not change during 
the evolution of the system, the so-called constants of motion, These 
quantities are easy to observe or measure. Important physical concepts, 
such as energy, momentum, angular momentum, etc., have been intro- 
duced, because these quantities are constants of motion under special 
conditions, or for simple systems. A complicated physical system may be 
characterized by its constants of motion rather than by the detailed behavior 
of its parts. The importance of such global quantities is best known in 
thermodynamics. 

In addition to the principle of relativity, we may have additional 
internal symmetry properties for physical systems. The laws or the equa- 
tions of the system are then invariant under certain transformations 
corresponding to this symmetry. In turn these transformations give rise 
to the existence of conservation laws. 

In this section we give a general discussion of the connection between 
the invariance and symmetry, on the one hand, and the conservation laws 
or constants of motion on the other, based on the action principle. We 
obtain first the integral expressions for the global quantities which are 
constants of motion, and then the equations of continuity which show what 
happens at each point, and finally their connection. 


(A) Conservation Laws in Integral Form 


In Eq. 3.56 we have calculated the variation of the action under a 
variation or a transformation of x“ and w*(x) simultaneously. Let us take 
the region R in this formula to be specified as follows. It is surrounded 
by two space-like surfaces 0, and o, extending to infinity. (Fig. 5.) Also 
we make the important but plausible assumption that the fields y“(x), 
their derivatives, L and L%, --- , all tend to zero at spatial infinity. Clearly, 
only then we can expect to find globally conserved quantities. We can 
then assume that B consists of a, and a, alone; the contribution of the 
surface parts at infinity can be neglected. Equation 3.56 becomes 


a ax -| do,{ Lox" + Leow] =| do,[Léx" + Lisp") 
R B a1 
= G(e,) — G(o,), (3.58) 
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where we have defined the generator of the variation 6’ as 
OL 
G(a) = | do,( Lox" + au). (3.59) 
« OW, 


Thus G is given by an integral over a three-dimensional surface; in par- 
ticular, if we choose do, along the time-direction, 


G(t) = e| a?x(Lot + = sy). (3.59’) 


Fig. 5. Space-like surfaces o; and oz, the four-dimensional volume R 
and the three-dimensional boundary B 


Now let us assume that our system has an invariance or a symmetry 
property under the variation (or transformation) expressed by the fact 
that the variation of the action vanishes; i.e., 


a| Ldx =0. (3.60) 
R 


Then.it follows from Eq. 3.58 that G(c) does not depend on the space-like 
surface o, nor on time; thus the generator G(c) of the variation is a 
constant of motion: 


G = constant. (3.61) 


To summarize, if a variation x’ = x + dx, '(x) = W(x) + d(x), does not 
change the action Jp Ldx, then G given by Eq. 3.59 is a constant of the 
motion, and is the same for all coordinate frames. 
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If G is constant and invariant, it follows from Gauss’s theorem! that 
the integrand of G is divergenceless: 
“ OL 
6, (sx hi ov‘) =10) (3.62) 
OW 
This is a generalized equation of continuity. We have already seen a 
special example of it in electrodynamics, Eq. 3.31. There is always a 
one-to-one correspondence between a constant integral G and a local 
“equation of continuity”. We shall discuss examples of this correspondence 


a little later. 
It should be remarked that, although a divergence term dT *(W, x) 


added to the Lagrangian L does not change the field equations, it does 
change the value of the constant of motion to 


G'=G +| da, 0,T%(W, x) dx". 
B 


Finally, we should like to note here, why f Ldx is called the action 
rather than Hamilton’s principal function, as in mechanics. Returning to 
Eq. 3.57, we see that for a variation that leaves | L dx invariant and for 
which 6’ = w(x’) — W(x) = 0, we get 


| do,(OFL — °,Lt) dx” = 0, 
B 
or 
0, T te A —Og + WL?) =10. (3.63) 


In analogy to classical mechanics T #', defined by this equation, may be 
called the canonical energy-momentum tensor. This is exactly. the procedure 
used in classical mechanics to obtain the energy conservation from the 
action,” which justifies denoting jLdx as the action quantity of our 
system. The energy-momentum tensor will be discussed in detail presently. 

Now we discuss the important special cases of our variation 6’ and 
the corresponding conservation laws. 


1. Coordinate Transformations. Let 5’ consist entirely of coordinate 
transformations, Lorentz transformations or more general coordinate 
transformations. A physical point P is labeled in two coordinate frames 
by x" and x’* = x* + dx", respectively. This transformation induces a 
change on w(x), although the field variables themselves are not changed. 


1 See Chapter I, Sect. 8. 2 See, for example, H. Goldstein, op. cit., p. 228. 
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At the same physical point P the field quantities in two coordinate frames 
are labeled by w*(x) and w’%(x’), respectively. y’%(x’) is given according 
to the nature of the fields, ie.; whether they are scalar, vector, spinor, 
tensor, etc. Let us write quite generally 
w's(x') = [55 + Ag(L)1W*(x), (3.64) 

Ag = 9, for a scalar field ; 

(1+A)j,=L%, for a vector field ; 

(1 + A) = S(L), for a spinor field, etc., 


and L% is the Lorentz transformation (or possibly a more general co- 
ordinate transformation). On the other hand, 


oy’* 


ox" 


w'a(x’) = '(x + dx) = p(x) + 


ox". (3.65) 


Combining Eqs. 3.64 and 3.65, we get the induced variation on y: 
Spt = W'4(x) — W(x) = Agwh(x) — Y5,x". (3.65') 


Therefore, by Eq. 3.59, the generator of the coordinate transformations 
is given by 


G(a) -| do, [Léx" + L(Agy? —~ y*,dx")]. (3.66) 
To further evaluate G, we now consider an infinitesimal Lorentz trans- 
formation 


x'* = x? + wx” + a", 


or 
Ox" aa") x” 4a", (3.67) 

where a" is a constant vector, and w"” = —w” isa constant antisymmetric 
tensor.’ The quantities A§ in Eq. 3.64 depend only on ®,,. If we write 

Ag = 459" Ouvs (3.68) 
then 

Si’ = —S5", (3.69) 
and 

Sz” = 0, if w is a scalar field, 


= 99s —9° OB: if y is a vector field, 
= Hy"y’ — y’y')§, if w is a 4-component spinor, etc.” 


1 See Chapter I, Sect. 4. 2 See Exercise 12. 
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We now rewrite the middle term in Eq. 3.66 in the form 

LiAgW? = 4LES9°°o,W* = f""w,,. (3.70) 
Now, from Eq. 3.67, 


55 Ox.) = 04, = — (6x); 
hence 
Sf?" o,, = — ns (f*""6x,) + ee 6 hail 
Ox Ox 
The first term, when integrated over the space-like surface a, gives 
| do, a (f"#°5x,) = 4 § do,[(f"5x,),« + (f"7"5x,),0] 


=4 do,(f"5x,)¢ — 4) do f"”"5x,) > (3.71) 


where in the second term we have interchanged the dummy indices p and o 
and then used the antisymmetry of f*’? in the indices yp and o, which will 
be justified below. The contribution of this term to G is a constant inde- 
pendent of whether or not we have Lorentz invariance. For if we evaluate 
Eq. 3.71 between two space-like surfaces and use Gauss’s theorem, 


( | -| )ao,7*"8x,). — da f?"5x,) 1 =| dx[(f"""5x,) on 
a2 a} ‘ V 


— (f"""5x,) ne] = 9. 


Therefore we can omit this constant number and obtain 
G(s) = | do, [Léx* + f¥?? ,6x, — Law*%,ox"] = -{ do,T*’5x,, (3.72) 


where we have defined the so-called energy-momentum tensor T*’ of the 
fields, 


TY =Liy®” — \ oe — Lg” (3.73) 


Before we discuss the meaning of 7*”, we must prove that f”” is anti- 
symmetric in p and o. From its defining equation, Eq. 3.70, we obtain 


foe? = HLESH? + LiSy* — Lesy* ; (3.74) 
indeed, if we multiply both sides of this equation with w,, and sum, we get 


SP Way = YLESG Wop + LiSG" Oey — LS a,,0". 
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If we change the dummy indices p and a in the last term and make use 
of the fact that both w,, and S”’ are antisymmetric, we see that the last 
two terms cancel and we get back the correct defining equation, Eq. 3.70. 
From Eq. 3.74 and the antisymmetry of S”’ we finally obtain the desired 
result: 


fi? = fr, (3.75) 


We shall discuss now the physical significance of the energy-momentum 
tensor 7"”. Let us use explicitly the fact that we are considering an in- 
finitesimal Lorentz transformation, and that 5x“ is given by Eq. 3.67. 
Hence the generator G(c) becomes 


G(o) = — { do,T*"(@,,X" + ay). 
Or, if we introduce the quantities 


BP. =| deg” s (3.76) 


t= | do,M*”” =| do, Tx? Fx"), (3.77) 


G(c) can be written in the form 
G(c) = —a,P” — 4o,,J”. (3.78) 


The quantity P” is the generator corresponding to translations and 
J” that corresponding to “rotations”. If for the system of fields we are 
considering, the Lagrangian is invariant under the inhomogeneous Lorentz 
transformations, these generators will be constants of motion. They must, 
therefore, be interpreted as the total energy and angular momentum of 
the system, respectively. 

Because L is invariant, the derivatives Lt = oL/oy*, have definite trans- 
formation properties. We see from its defining equation, Eq. 3.73, that 
T”’ is indeed a tensor. Therefore, P“ is a four-vector and J#” is a tensor. 

If G(e) is a constant of motion, we obtain from Gauss’s theorem and 
the vanishing of T“” at spatial infinity the equations of continuity 


T= 9, (3.79) 


Mh, = 9. (3.80) 
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These equations in turn imply that 7“ is symmetric, for 
= BY ee SS — Tee ee PHeGy 
hence! 
Petals: (3.81) 


We see that the variation introduced at the beginning of this section 
automatically yields a symmetric energy-momentum tensor. Notice that 
the previously introduced, so-called “canonical energy-momentum” 
tensor, Eq. 3.63, is not symmetric. We have the relation 


1 ae eee | anne (3.82) 
The two tensors become identical for scalar fields; their difference is due 
to the fact that in calculating T’” we have taken into account the trans- 
formation properties of the underlying fields. 

The infinitesimal Lorentz group has ten parameters. The corresponding 
ten generators P“ and J*” are identical with those of the Lorentz group 
discussed in Chapter I, Sect. 5. To evaluate P* explicitly, one can con- 
veniently choose a frame such that the surface t = constant is the space- 
like surface. Then 


Jes =| Be ax: 
A 
in particular, the total energy is the space-integral of T°°. In the par- 


ticular coordinate system considered, the equation of continuity T”” , = 0 
implies 


{ T*? d3x = -| 7 dex k= 1, 2, 3, 
ax Jy Vv 


--| Rena s,, 
Ss 


where V is a three-dimensional volume and S is its surface. The last 
integral vanishes if T“* vanishes at spatial infinity. Hence [y7"°d°x = P” 
is both a constant and a Lorentz invariant. 


2. Change of Field Variables. Here we consider an intrinsic variation 
of the field quantities so that no coordinate transformation is involved: 


6x*=0; W(x) = W(X) + Soy. 


1 We see then from Eq. 3.73 that 
Legge Lgpee = foe f°". (3.81’) 
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From Eq. 3.59 the generator corresponding to this variation is given by 
G(s) = | do, LtSo" = | do,j", (3.83) 


where we have introduced a certain four-vector j* corresponding to the 
variation considered: 


ji = er. (3.84) 
Now if this particular variation of y leaves {, Ldx invariant, then G(o) 


given by Eq. 3.83 is again a constant of motion, and assuming that the 
fields fall off to zero at spatial infinity, we get the equation of continuity 


jt, =0. (3.85) 


We shall see that the gauge transformations and the corresponding 
continuity equation for the current belong to this category. The generator 
in Eq. 3.83 may be interpreted as the total generalized charge of the system. 


3. Coordinate Transformations and Intrinsic Variations. In the general 
case of a simultaneous coordinate transformation plus an intrinsic change 
‘of the field variables, dy in the fundamental equation, Eq. 3.59, consists of 
two parts, one comes from coordinate transformations (induced dy-varia- 
tion) and is given by Eq. 3.65’, and the other is the intrinsic variation 
do. Hence 

Ow = AGW? — pbx" + dow, 


and the generator is given by 
G(a) = | do,[Ldx" + Li(Agw? — W2 dx" + So)] 
= -| do,[ T*’dx, — L*do*], (3.86) 


which is the sum of Eq. 3.83 and the generalized charge. 


(B) Conservation Laws in Differential Form 


In this section we shall derive the conservation laws in differential 
form directly from the Lagrangian without using the action integral. We 
go back to Eq. 3.50 and assume that the field equations are satisfied. 
Then the variation of L is given simply by 


5L= 0, (L*5p/*). (3.87) 
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This equation connects 6L with an arbitrary variation dw consistent with 
the equations of motion; i.e., variations which leave the equation of 
motion invariant. 

If, in addition to the equation of motion, the Lagrangian itself is 
invariant; i.e., dL = 0, we get immediately the equation of continuity 


a, (L#5W%) = 0. (3.88) 


For an intrinsic variation dg, this is precisely Eq. 3.85 with j* given by 
Eq. 3.84. 

Now let us consider the infinitesimal inhomogeneous Lorentz trans- 
formations, Eq. 3.67, and evaluate both sides of Eq. 3.87 under these 
transformations. We have 


L yb, 
6L=L *«)—L(Xx) = Ss bx" = — X(orx, +a") 


ri 
=— os a 0 (x, — Xx =). 
The variation of dy under Eq. 3.67 has been evaluated already in Eq. 3.65’: 
5W* = W(x) — WX) = ESO W? — Yi dx" 

= Via" + do SH — OW” — x). 
If we insert these expressions into Eq. 3.87, we obtain 
Fn a" — i0(x, = — Xy =) + 6,{LowSa’ 

= WhorreL S9Up — (x'W" — x°Y"NT} = 0, 
or, using Eq. 3.70, 
a’a,(— Lé# + L4y%) + 400, —58x,L + Sex, L — fe? + fier 
+ LU(xty** — x4y*")] = 0, 

or 

aT” , + 40M"? , = 0, (3.89) 
where T is the canonical, unsymmetric energy-momentum tensor 


° 


yer Leo — Ly", (3.90) 
and we have introduced 


M#? = ‘fice — fee 4. (Tx? me T"?x?), (3.91) 
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Because a, and w’? are arbitrary, the equation of continuity follows from 
Eq. 3.89: 

T”,=0, and MM =, (3.92) 


Therefore, this method leads first to the unsymmetrical T”” rather than 
to T*’, as in the previous case. By Eq. 3.82 the difference between 7“” 
and T*” is a divergence, T*” = T*” — f**",; hence Eq. 3.92 implies that 


T™ ,=0, 


and vice versa, because f"’” is antisymmetric in the first two indices. 
Furthermore, the energy-momentum four-vectors evaluated from T“” or 
oO 

T”’ differ by a fixed constant, because 


o2 
| PB) hae ail axfl" = 0: 
o1 R 


We also show that, if Eq. 3.92 holds, T*” so defined must be symmetric. 
From M*’? , =0 we get 
TH = TT =e — fv * 


on the other hand, 


TH = TH = Te fake Te — fo 7 — fm ps 
Combining the last two equations, we see that T”” = 7”. Q. E. D. 
Conversely, these results may be used to symmetrize the canonical 
energy-momentum tensor and to find the explicit form of f#”? that should 
coincide with our previous derivation.! Thus, starting from T*” we can 
re-establish the symmetry of the energy-momentum tensor. Let 
TY THe = cer F. 


We want to show that C” = f*"”, if Thas to be symmetric and also satisfy 
an equation of continuity. The latter requirement implies that 


 Ohied = —crey, 
and the former implies that 


TH _.7T" = Oe = Gg) 
The left-hand side of this equation is known from Eq. 3.90. Let 
TH — T= Liew _ Layee = Bee 


1 F, Belinfante, Physica, 6 (1939), 887. 
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Hence it is sufficient to require that 
BH = CoH _ Con. 
A solution of this equation satisfying C°*” = —C"” is 
Co = 4B — Bro 4 BH) _ Om, 


Then, from Egs. 3.74 and 3.81’, the desired results follow. 

We see that we can formally symmetrize the canonical energy-momen- 
tum tensor by adding to it a suitable divergence term. However, the direct 
derivation of a symmetric energy-momentum tensor T“” from general 
principles is the more correct and fundamental approach. Another deri- 
vation of the symmetric T”” from the variational principle involving a 
change of the boundary is treated in Sect. 6, Exercise 10. 

Returning to our starting point, Eq. 3.50, we see that for variations 
that leave the form of the Lagrangian invariant, 6L = 0, but not the field 
equations, one obtains a generalization of Eq. 3.88, 


A,5W* + 0,(LESW%) = 0. 


(C) Lagrangians Not Invariant Under Translations 


The Lagrangian density L is always assumed to be a scalar with respect 
to homogeneous Lorentz transformations. But L may not be invariant 
under translations. An example, which we shall consider later in detail, 
is the case of an electromagnetic field interacting with a given external 
current. 

In this case L has an explicit dependence on x“, and we get an addi- 
tional term in Eq. 3.50. It becomes 

OL 


dL = A," + 0,(L45y%) + — 


ee ee (3.93) 


exp! 


the last term is due to the explicit dependence of L on x*. Consequently, 
for inhomogeneous Lorentz transformations, we obtain 


(3.94) 


but 
M+? = 0. 


Clearly the energy and the momentum of the system is not conserved, 
but the total angular momentum still is. 
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(D) Explicit Form of Conserved Quantities 


In Table II we have evaluated the energy-momentum tensor 7” and 
energy-momentum four-vector P’ for a number of simple fields considered 
in this book, using the general theory of the present section. (See also 
Table I for the corresponding Lagrangians and the equations of motion.) 
The evaluation of J*” from T*” is left to the reader. (See Exercise 13.) 

Some of these fields also admit an intrinsic variation of w, such as 
gauge transformations. In this case we also give the corresponding 
generator G and the (current) density /*. 

For the scalar field the spin term f“”" is zero, and we find 


T= TH = orp” noms ae 8 a gu” s 49""(¢"o., = m*?). 
The corresponding value of the energy-momentum vector P“ in a par- 
ticular frame is given in Table II (P* = { d°xT). 
For the complex scalar field we find from the Lagrangian given in 
Table I that 
T” = T*(g,) + T*'(O2), 
where ¢, and @¢, are the hermitian fields defined by 
1 
V2 
Equation 3.95 can also be written in terms of ¢ and ¢*: 
TY = pr af gup*” _ gL. 
The complex scalar field admits also a gauge transformation. The 
Lagrangian L(#,) + L(¢2) is invariant under the transformation 


‘ (*) 
+ Ser |, 
i = We 


where T is a 2 x 2 matrix. Because ¢, and ¢, are real, T must be anti- 
symmetric and imaginary: 


1 * me _ At 
P= Boro”, $2 =—=(¢ — 9”). 


This then corresponds to the following transformation of the complex 
field: 


o> > + ied ; 
¢* = ~~ of ied*, 
as can also be verified directly by the invariance of the Lagrangian written 
in terms of ¢ and ¢*. 
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The generator of the transformation, Eq. 3.97’, is given by 


G= | do, Ltoy* = | do.J” 
and we find 
j* = e(oh* — o6*"). 
The invariance of L implies that 
J’, = 0. 
Physically j* is the current density and G the total charge of the Klein- 
Gordon particles. Therefore the one-component real scalar field describes 


only neutral particles. 
The free electromagnetic field with the Lagrangian 


L= Jers 
gives 0L/0A*” = F*’ and, according to Eq. 3.90, 
TM = — FA," + ig FOF,,, 
which is neither symmetric nor gauge invariant. We now symmetrize this 


canonical energy-momentum tensor according to the general principles 
discussed in the previous section. We obtain 


pe = ee AC 
and 
ead = AY GF? ; 
consequently, 
TH = FA,” — AY F! — g!T, = FURY givy 


which is obviously gauge invariant and symmetric. 
It is instructive to write out T”” in terms of the energy density H, the 
Poynting vector 


S = c(E x B), 
and momentum density 
1 
G=—<S 
(mse 
in the form 
1 1 
H =S, -S, =S 
¢ 6" eee 
cG, 
TE = ; : ; 
eG yy 2) 
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The equations T”” , = 0 take then the more familiar forms 


OH 
ev = 0 5 
Ot 
1a 0 ik : 
err staat = (2 i) =3 IL, BAS Be 


and express the continuity equation for energy and momentum. 

The free electromagnetic field also admits a gauge transformation for 
which dy* = A’*. The corresponding constant of motion is given, according 
to our general theory, by 


G= | do,F°"A,,, 


which corresponds to the continuity equation: 


0,(F?#A,,) = 0. 


5. Canonical Form of Field Equations 


In this section we discuss the Hamiltonian form of the equation of 
motion for fields, generalizing the corresponding discussion for particles. 
(Chapter II, Sect. 3.) The distinction is that instead of the one independent 
variable s, the proper time, we have four independent variables x’. The 
Lagrangian density is a function of the field variables y* and their deriva- 
tives: L= L(y", y%,, x). We define the conjugate field variables (or 
momenta) by 

TI? = oL/oy’, = Lt, (3.96) 


— 


and introduce a scalar function 
C = Tihs, — L, (3.97) 


or a tensor which is the canonical energy-momentum tensor already 
introduced: 


im = Ty — gL 6.98) 


Provided that we can solve in Eq. 3.96 y* ,, as a function of II, y and x 
we can express C or T”” as a function of * and IT?: 


C=C, zy). 
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If we now evaluate the derivatives of C with respect to w, (or IT4) by fixed 
x and If (or w,), we find that 


aan 
amg’ 
i =- ae = —Tt,, (3.99) 


where in the last step we have made use of the field equations, Eqs. 3.52. 
Similarly, with T*’: 


OE ee 

a# dae 
oT” aL 
—— = gh ee pe 3.100 
ay" g oy? g ao ( ) 


Equations 3.99 and 3.100 are the formal covariant Hamilton equations 
of field theory. The success of the method lies—we repeat—in the possi- 
bility that one can solve y/%, in terms of IT# in Eq. 3.96. Table III shows 
some examples where this method works. The Lagrangians are given in 
Table I. In the case of the electromagnetic field, we must use the Lagran- 
gian L,,,;. For the others one cannot solve A‘, in terms of If and A,. 
The table gives C and T“’. The first set of the canonical equations are 
identically satisfied. The second set gives the equations of motion as 
shown in Table I. 

The covariant quantities C and T*” used as “Hamiltonians” above do 
not have the meaning of energy. Similarly, in Chapter 2, Sect. 3, we have’ 
first introduced covariant “Hamiltonians” and then showed that certain 
energy expressions can be used as bona fide Hamiltonians. In field theory, 
also, the standard and useful practice is to use noncovariant canonical 
equations with the energy density being equal to Hamiltonian density #, 


= T (3.101) 


The canonical equations are now given, remembering that 


ee = TT, oy”? am L, 


by 
Ce 
ame © =a, (3.102) 
and 
0x éL 
(3.102’) 


ut ly 
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If we use the field equations, Eqs. 3.52, the second set of canonical 
equations can be written as 


ai ieee ou 8 bs 5) 
ay? xt 
or 
: OH 6 
0 ee Sh), kee 3.103 
NB es oy" Ox* ( e) ( ) 


. Equations 3.102 and 3.103 have the usual form of canonical equations, 
except for the additional term in Eq. 3.103. The field variables W* corre- 
spond to “coordinates” and the zero components of the conjugate 
momenta, IT°, correspond to “momenta” of the system. 

We also find that if the Lagrangian density L does not depend explicitly 
on time, the Hamiltonian density is constant in time. For 


dH @# 10: ee eo. OEE OL =2,0 OL A 
ae = Ot. = IT, oy** + Lee = oy v oy? ? Oy * a 


and the right-hand side vanishes by virtue of field equations. 

It should be remarked that, although the canonical Eqs. 3.101 and 
3.103 are not covariant, they are perfectly relativistically invariant. These 
equations can be written down in every frame. The Hamiltonian density 
has a definite transformation property under Lorentz transformation; 
namely it transforms as the 00-component of a tensor. (See also the dis- 
cussion in Chapter I, Sect. 6.) 

The Hamiltonian itself is given by 


H = [axe = a d?xT° — po (3.104) 


6. Lagrangians Involving Higher-Order Derivatives) 


So far we have considered Lagrangians which were functions of field 
quantities and their first derivatives only. There are also physical examples 
where higher-order field equations, and consequently Lagrangians de- 
pending on the higher derivatives of field variables, occur. Again the field 
quantities may be any spinor or tensor fields. 


1 For the material in this section we follow A. O. Barut and G. Mullen, Ann. Phys.. 
(N.Y.), 20 (1962), 184 and 203. ’ 
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We consider in this section the action principle 


OW, =6 | Lata, 4G, °° jidx, (3.105) 
a2 


where the subscripts pv mean i derivatives of field variables yw, 
We = OW7/Ox", Wainy = GW"/dx"dx"Ox* --- (n times) --- etc. Let N be the 
highest derivative occurring in L. We introduce the intrinsic variations of 
field quantities and their derivatives which do not involve any coordinate 
transformations: doW*(x) = y'*(x) — w*(x) and similarly doWi, doWiin) 
such that 
SW in) = OumyOoW"*, (3.106) 
where 
0,(n) = 0,0,0, *** (n times). (3.107) 


The intrinsic variation of the me density is given by 


Sok = Lae = bie (3.108) 


Let us introduce the symmetric function 

OL 
Wrew : 
all permutations 
of indices p(n) 


Le) = J > 


a (3.109) 


Then 
LE 8 So® = (— 1)" Buen SoY* + O,0LE Oyen 150" 
> o* ieee + + (- ne Oren LAM Soy. 
_ (3.110) 


With this, 5,Z can now be written as follows: 
N N-1 
OoL = + (- 1)” Dyn ee So" + a,| AF re) 2m bo¥"| 5 (3.111) 
n=0 n=0 


where we have defined 
N-(n+1) 
THO (1) Orgy he. (3.112) 
m=0 
Here pv(n) denotes a single index p and a set of n indices v. Now, as usual, 
we separate the action principle into a part involving an intrinsic variation 
of L due to the variation of field components and a part involving the 
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change of the region of integration due to the displacement 5x": 


| Ook dx + (| -| ) do, 6x"L = 0. 
o2 o1 2: 


If we insert d)Z into this equation and make use of Gauss’s theorem 
(see Sect. 3), we obtain the Euler-Lagrange equations 


N 
Del — 10 Bayh 0, (3.113) 
n=0 


or, written out explicitly, 
oL 7) (=)+ a? (=) i BG 
dy* dx" ape) © axtdx” \ayr, as 
We also obtain the generator for infinitesimal transformations 
N-1 
F(c) -| do, [ue + >) Wee anbo¥'], (3.114) 
a n=0 


so that 
OW, 2 = F(o;) bd F(o,). (3.115) 


It can be verified easily that these formulae reduce to the ones discussed 
in Sect. 3 for the case where L depends on the first derivatives only. 

We shall now develop a canonical formalism and obtain expressions 
for energy, momentum, charge, etc. For this purpose, we note first that, 
given dow", the variations 590, ()/* cannot be arbitrary. We must express 
the generator F(o) in terms of the independent variations of the field 
quantities. Now the variations 590,(,/" that are normal to the space- 
like surface are arbitrary. Therefore, we separate the derivatives 0, into a 
normal directional derivative d and a tangential derivative é, as follows: 
d=n,0", @=0,—n,0, (3.116) 
where n" is the unit time-like vector, nn, = 1. We have then 

nd, =n"d, — n'n,d =0; 


i.e., n” is perpendicular to d,; in a special frame n“: (1000) 6 = Oo. We also 
put ; 
0 = 06+--(n times), Nun) = N,Ny* + *(n times), 


We restrict ourselves to surfaces such that the variations approach 
zero sufficiently rapidly, at spatial infinity, so that 


f do, 5 hal — OF 


this integral can be transformed into a surface integral at infinity. (See 
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footnote at the beginning of this section.) We can then write the second 
term of F(c) as follows: 


N-1 N-1 
do, ” | a Ayn" sy | do, oe We, 5 i 1,0 )inyoW", 


and, after some calculation, 


N-1 
=| do, F156), 
kK=0 
where we have introduced 


N=1 
(= eid, Tr, (3.118) 


with 


tel n! 
m} (n—m)!mY 


Then the generator becomes 


N-1 
F(c) = | ac, | y T1#5,(6y*) — Léx ii (3.119) 
a k=0 
We now introduce the total variation dy*, defined by 
OY" = FoW* + W(x) — W(x) = So" + Wilx)dx" — de"SH W(x). (3.120) 
For higher derivatives we obtain 

SYieny = SoWeeny + WiamOX” — Se*° SWE ny » (3.121) 

When this expression is multiplied by 71,;,), we get 

5G MY) = 54) + 0,10) Sx” — e** SAMY". 

With the help of these expressions we now rewrite F(o) in terms of the 


total variations: 


F(o) =| do, 5 [56 ys) — 1#9,(6 Myx" 
co k 


=0 

+ fe? TIM see gy] — Lax! (3.122) 

Analogous to the first-order case of Sect. 4, we define the quantities 
fie = 4[T1#s47g My« a TI S446 Oya Pe T1456 y77, (3.123) 


which are antisymmetric; f4}” = —f{’. Because of this property and the 
fact that 
do, ayf"” = [ do, 6,f*™, 
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we can write finally F(c) in the form 


N-1 ‘ 
F(a) = | do,| > 36 ys) — T*8x,), (3.124) 
o k=0 
where 
N-1 5 
TY’ = » (1#a"(6 ys) _ afr} ~ gL, (3.125) 
k=0 


Now, as before, we arrive at the conservation laws by considering 
coordinate transformations 


bx*=a"+ex,, and 5(6y*)=0. 
The corresponding generator is 
~F,,(6) =a*P, + yes, , (3.126) 
where the energy-momentum vector and the angular momentum are given 
by 
P’= do,T”, J" = | do,M**”, (3.127) 
with 
| MY = — MM = THY _ Tary, (3.128) 


If the action integral is invariant under these transformations, we see that 
P* and J*” are independent of the space-like surface a; hence they are 
constants of motion. The differential conservation law is 


6,7" = 0,M**” = 0, (3.129) 


which in turn implies that T”” is symmetric. 


As an example of intrinsic field transformation, consider the gauge 
transformation 


y's = pr etite, and =x, = O. 


iu 
Because now 6(dy*) = ie,A61dy*, we have 


=0 


F,,(a) = [ do, le mgPn(era526%4) = 51Q(0), (3.130) 


where 


k=0 


Niel we 
010) = | do, t= ict, VY TIM#*gMy« (3.131) 


As usual, we can interpret Q as the total “charge” and j" the corresponding 
current density. 
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7. Further Developments and Exercises 


1. Show that Faraday’s law expresses a relativity principle for a moving 
circuit in a fixed magnetic field, on the one hand, and a fixed circuit in a 
moving magnetic field, on the other. 

2. Prove Eqs. 3.17 and 3.20. 

3. What is the time dependence of a spatially constant magnetic field? 

4. Given that E is perpendicular to B and E < B, find a Lorentz frame 
in which E = 0. 

5. Calculate the field of a uniformly moving charged particle by using 
a Lorentz transformation. (In Part II a different method will be used for 
the same problem: see Lienard-Wiechert potential.) 

6. Using Eq. 3.42 find the transformation property of the dielectric 
tensor ¢;; and magnetic permeability tensor y;; of material media under 
Lorentz transformations. 

7. Discuss the derivation of Maxwell’s equation in material media 
from the Maxwell-Lorentz equations by a statistical averaging process.’ 

8. What is the meaning of gauge dependence of the energy of a charged 
particle as given by Eq. 4.158? 

9. For any tensor A*’?, prove the following identity: 


al _ ARPP = A[xP( Ane” — Agve — Aree _ Aver — AcHv _ Aer) 
+ {a <> p}] = (x?A"") , — (x°A"),, (3.132) 


Solution. First calculate the following three divergences with respect 
to first, second and third indices in the form: 


Ox? 
wop _. 4AuVp po _ uvp 
A amA no es A ad 


= (APES). =e, 
aA"), =A tx, AME) 
[agar — AM) ] + AR”, 
Av? = (x?A™ ,) ad a) ae 
=P AVE), AE), HA 
= [x°(A™ — A”) J], + AM 
1J. van Vleck, Theory of Electric and Magnetic Susceptibilities (Oxford, 1939). 


P. Mazur and B. R. A. Nijboer, ‘Derivation of Maxwell's equations from electron 
theory,” Physica, 19 (1953), 971. 
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and 
AvPr = (x? Ae), — x? ae 
= (x7AMP” ) yp — (X°AM ) + Xt AP 
= [x( Auev Are) |e fe ARAL, 
Adding these three divergences, we obtain 
Altes = [x7( A”? a= Ave + AteY = Aral + [x?(Ar™* — Ao) oles 


ei Ae, 
‘and 


AM? .=(oep). 

The difference of the last two equations gives the desired result, Eq. 3.132. 

10. Derive the symmetric energy-momentum tensor T“” from the varia- 
tion principle involving change of boundary, Eq. 3.57’ and the identity, 
E@@3:132% 

Solution. For an infinitesimal homogeneous Lorentz transformation, 

5" = Agh?(x) = 4557, W*. 
If we insert this into Eq. 3.57’, we obtain 
OLSLES{%00,,0" + (GML — Y*"Lio,,x?] = 0; 

or, with Eqs. 3.63 and 3.67, 


Lf"? _ + Te, x"| =0; 
or 


O44, fF? — fore + xT — x°T*)] = 0; 
or, by Eq. 3.91, 
aM = af fm — gee 4. zene _ xoiwe] = 9, 
We now apply the identity proved in the previous exercise to tae) 


to obtain 


A,[x°(T" + TH) — x°( TH" + THY] = 0, 
or 
(x°T** — x°T#?) , = 0, 


where the symmetric energy-momentum tensor is given by 


TS T+ af + ft 4 ft — fear — peony 
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which, by virtue of the antisymmetry of f#°, Eq. 3.75 reduces to 


Th = TO + (fe fra ey.. 3.133) 


11. Prove the algebraic steps leading to Eq. 3.46, the properties of the 
B matrices, their reduction and finally the two-component forms of 
Eq. 3.46. 

12. Prove Eqs. 3.69 and evaluate Ss”” for two-component spinors. [In 
the transformation equation 

g(x’) = AG(L)E"(x), 
one has to express Aj(L) for an infinitesimal Lorentz transformation 
Li = 6 + of, and write Aj(L) = 55 + 4S%""w,,, which gives S.] 

13. Calculate the angular-momentum tensors J*” from the energy- 
momentum tensor T”” for the fields given in Table II. Discuss the 
physical meaning of the angular momentum of the electromagnetic field 
in detail. 

14. Calculate the conserved quantities, as in Table II, for the Dirac 
field. The Lagrangians are given in Table I. 

15. Conformal Invariance. \t has been known for a long time that 
Maxwell’s equations are invariant with respect to a larger class of trans- 
formations than the Lorentz transformations, namely the so-called group 
of conformal transformations. [H. Bateman, Proc. Lond. Math. Soc., 8, 
(1910) 228; 8, (1910) 469; E. Cunningham, Proc. Lond. Math. Soc., 10, 
(1910) 77.] The general conformal transformations have 15 parameters (a 
15-parameter Lie group). They include restricted inhomogeneous Lorentz 
transformations (10 parameters), a scale transformation xj, = Ax, (1 para- 
meter) and the nonlinear transformations 


1 
ee Lt 2g"): 
x ni xiay: 


N =1-—2a"x, + ax’, a 


. 1 ees: 
where a” is aconstant vector. Because x'*x, = N x'x,, this is still a trans- 


formation from flat space to flat space. [A generalization to arbitrary 
metric g”*(x) can be made, which is not considered here.| For a special 
choice a”: (—g/2, 0, 0, 0) and in the nonrelativistic limit c «+ 00, the above 
nonlinear transformation describes the passage to a uniformly accelerated 
frame of reference: t’ = t/(1 — 4g7t?) and z’ = —4gt?/(1 — 4971”). 

Show the invariance of Maxwell’s equations under this larger class of 
transformations. 

16. Derivation of Spin Precession from Field Theory. Consider a spinor 
field y interacting with the electromagnetic field. Using the formalism of 
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Section 4, in particular equation 3.77, evaluate the angular momentum 
J,,. The total angular momentum has two parts, one coming from the 
canonical field tensor T*’, the other from the spin part. Correspondingly 
we may introduce two dual vectors ¢, and W,, by 


t, = SE yvapP I ee pitat 

W,, = ee ae 
Derive the equations of the motion for these vectors from the divergence 
of T*’. Compare the equation of the motion of the spin vector W,, with 
_that discussed in Chapter II, Section 4.(See also Chapter I, Problem 14, 
for the definition of spin from the Lorentz group). 
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PART II 


Interactions 
of Fields and Particles 


IV. 


/ ———— OF MOTION 
AND THEIR SOLUTIONS — 


The description given in the first part of this book of treating the 
particles and fields separately is essentially an approximate one. As we 
have noted before, an accelerating particle produces a field itself which 
changes the external field in which the particle is moving. This new field 
in turn will effect the motion of the particle. Thus, the theory of inter- 
acting fields and particles is intricate and involved, both conceptually and 
in its mathematical formulation. In this part we shall build up this theory 
gradually, starting from simple interactions. We give first a theory from 
the physicist’s point of view, using 6 functions, etc. The question of 
mathematical consistency and completeness will be discussed at the end. 

The three most important aspects of theory of interactions that we 
shall emphasize are (1) the problem of radiation and radiation reaction, 
(2) the equation of motion of particles with radiation reactions, and (3) the 
consistency, completeness and limitations of the classical theory. 

We begin with the Lagrangian form of interactions and the equation 


of motion. 


1. Interaction of Fields with External “Currents” 


Let w(x) be the set of field variables and L® the Lagrangian density 
of the free fields. In the presence of a variable external “current” j(x) 
which acts as a source for the fields the field equations are modified. The 
precise way in which the current j(x) acts, therefore its definition, can be 
given in terms of the Lagrangian formalism. We treat the interaction of 
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the field Y“(x) with the current j(x) by an additional term in the Lagrangian 


density 
L=Li + L, (jx), ¥%(x)). (4.1) 


Later we shall give explicitly j(x) in terms of the particle coordinates. Let 
it be a given function for the time being. 

As an example, we consider the electromagnetic field A a: The simplest 
scalar interaction term with a given current four-vector j"(x) is of the 
form A“j,, and Eq. 4.1 takes the form 


1 
Lisi tie — aan (4.2) 


The field equations corresponding to this Lagrangian, as obtained from 
Eq. 3.52, are 


PY, = — 2 iH), (4.3) 


These are precisely the Maxwell equations, Eq. 3.34, so that the interaction 
written in Eq. 4.2 is the correct one. 

The Lagrangian given in Eq. 4.2 is not gauge invariant. However, 
under the gauge transformations of the potentials A,, the sources that 
produce the field also undergo such a transformation as to produce the 
continuity equation 

Ji, = 0, (4.4) 


and as to make the complete Lagrangian gauge invariant, as we shall show 
later. We note here that under a gauge transformation, Bg Anus 
and under the assumption of Eq. 4.4, the Lagrangian Eq. 4.2 gets only a 
divergence term —1 Ic(Aj*) us 

The total energy-momentum tensor derived from Eq. 4.2 is 


1 
OM = TH+ (4, j%g* — AX), (4.5) 
where T” is the energy-momentum tensor of the free electromagnetic field, 
as we have obtained before, 
TY’ = FMF Y — ghey, (4.5’) 


Because the Lagrangian Eq. 4.2 is not invariant under translations, we 
expect that ©*” does not satisfy an equation of continuity. Indeed, we 
get from Eq. 4.2, using Eq. 3.94, 


: oL hay, se, 
(Ol nS ya ia oA 5 (4.6) 
v jexp 
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On the other hand, from Eq. 4.5, 
v - v 1 veo . r} 
eo” ,.= T+ 7 (Ag + Agj”” = AX j° — A'j%,). 
Comparing this equation with Eq. 4.6 and using Eq. 4.4, we obtain 
FY Se FY =k", (4.7) 


where we have also introduced a new four-vector k” which we shall call 
the force density. The physical interpretation of Eq. 4.7 is as follows. If 
the field is “‘driven’”’ by a current j(x), the energy-momentum tensor no 
longer satisfies a continuity equation, as in the case of free fields. In other 
words, the energy and momentum of the field changes as a function of 
time according to the equation 


P"(o,) — P*(a;) =f dxk’, (4.8) 
R 


where R is the region between the two space-like surfaces o, and a}. Or, 
in a particular frame of reference, 

aP*/dt = | d*xk” = K’; (4.8) 
i.e., the rates of change of the total field energy P° and total field momen- 
tum are given by the space integral of the force density, or the total force 
K’, which is, of course, the analogue of the Minkowski equation for a 
single particle discussed in Chapter II. 

It is instructive to derive the fundamental equation, Eq. 4.7, from the 
definition, Eq. 4.5’, of T”” and the field equations, Eq. 4.3. Note that in 
the above derivation we have not used field equations but a property of 
the Lagrangian that being not invariant under translations. Differentiating 
Eq. 4.5’, we obtain 


Tg! Aateees, IY, .) 


Lae oury op, 
a le = alle tT 


We now make use of the identity, Eq. 3.33, or 


pia el ae 5g aa lil nae Fea) 
to obtain 


1 os vo vy 
To P tse |” SP. 


The last two terms cancel and we get again Eq. 4.7. Q.E.D. 
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Equation 4.7 is not in the form of a conservation law and we have 
seen that the energy and momentum of the field are not conserved. (The 
angular momentum is conserved, because the Lagrangian is invariant 
under homogeneous Lorentz transformations.) If we introduce an energy- 
momentum tensor t"” for the current; ie., the matter tensor, by the 
equation 


1 
HY = —k’ = — . Ey. (4.9) 
we can express Eq. 4.7 in the form of an equation of continuity: 
Pee) ae. (4.10) 


Consequently the total energy and momentum of the field and the matter 
are conserved; they are not conserved separately. There is an exchange of 
energy and momentum between the two systems. 


2. Interaction of Fields with a Particle 


The discussion in the previous section has shown that from the point 
of. view of both field equations and conservation laws it is desirable, or 
necessary, to specify the nature of the current J(x) explicitly. Now, we 
consider the system of fields and particles together when there is an energy 
exchange between them. The current J(x) will be a function of the particle 
coordinates z“(s) and their derivatives. The total Lagrangian can now be 
written in the following form: 


L= LY, We") + LEZ", 24) + Lit, ym, 2", 24, (4.11) 


which consists of terms corresponding to free fields and particles plus an 
interaction term. If we vary L with respect to the particle coordinates, we 
obtain the equation of motion 


mocz# = K*(*), (4.12) 


where the force now depends on the field variables. Similarly, the variation 
of L with respect to w(x) gives the field equations 


A, = f,(z"), (4.13) 


with an inhomogeneous source term due to the presence of the particles. 
A, = 0 are the free-field equations. The two coupled equations, Eqs. 4.12 
and 4.13, then form a complete framework describing the interactions of 
fields and particles. 
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Let us consider a single particle, and let z* = z"(s) be the world line 
of the particle. The current j“(x), as a function of x, is different from zero 
only along the world line. 

For a charged particle interacting with an electromagnetic field, we 
have already introduced a current four-vector, 


oi > pocu", 


which, as a function of x, can be written now as 


7 eo cc[ dsz"(s)d(x — z(s)), (4.14) 
with “_ | 
#2, =1. (4.15) 


First we must verify that, for this particular form of the current vector, 
the equation of continuity, Eq. 4.4, is satisfied, as it should be, because 
the charge e of the particle is conserved. Indeed, 


i= cc | © dsih(s) [ile — 268) 


or, because 
Of (x — z)/dx* = —Of(x — z)/dz", 
and 
(6f/0z")(dz*"/ds) = df/ds, 


+0 


i= —ec[ ds [6(e — x0] = ~eod(x — 28) 


=O050:ED. 
Now we must use the Lagrangian Eq. 4.2 with j* given by Eq. 4.14. 
Under gauge transformation, the Lagrangian changes by a total diver- 
gence term so that the equations of motion are gauge invariant. They are, 
of course, a 
+ 00 - 
FY |= —e | dsz"(s)(x — z(s)). (4.16) 


The fundamental Eq. 4.7 is now 
Te , Ser’ { d8z,(s)0(x —2(s)) =eF""(z)z,(s) | s)=x- (4.17) 


Thus, the divergence of the energy-momentum tensor of the field at x 
is essentially the Lorentz force on the particle when the particle is at the 
point x: z"(s) = x. This in turn is equal to mass times acceleration, mocz"(s). 
Indeed, for a free particle with a Lagrangian 


LG? = moc[(24(0))*1"”, 
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we have the equation 


oO PeL al Hae, 
ee age) Se eer v 
oL ar (= 6z ) a5 (= 6z ) dz’/dt 


and under a translation, dz“ = a" = const., and translational invariance, 
oL = 0, the equation of continuity is 


(mgcz*z") , = 0. 


We can, therefore, define an energy-momentum or matter tensor for the 
single particle by (with correct dimensional factors) 


TY = —moc? j dsz#z"5(x — 2(s)), (4.18) 


with t", = —moc?. We want now to show that the sum of the energy- 
momentum tensors of the field and the particle is conserved. 
From Eq. 4.18 we get 


HY = —moc? | dsz"z" 0,6(x — z) = moc? J ase” o [d(x — z)] 


d 
= myc? | ds rr [2°5(x — z)] — moc? J dsz*5(x — z); 


the first term vanishes, hence 
tH” = — moc? | dsz"5(x — z(s)). (4.19) 
The conservation of the total energy-momentum tensor (7*” + 1’) , = 0 
is now satisfied, in view of the equation of motion for the particle 
ey e vu 
Mince = ‘ Vile oe (4.20) 
The complete action governing both the field and the particle is given 
by 
ane) e 
f Ldx =J moc(2"’)'? dr + = J A,(x)2"(2)5(x — 2(t)) dx dt 
+4§dxF"'F,,. (4.21) 


It can be shown that this action leads to the coupled Eqs. 4.18 and 4.20, 
just discussed. Furthermore, the conservation law derived from Eq. 4.21 
is just 

(Tit ape =@; 


with T*” and t"” given by Eq. 4.5’ and Eq. 4.18 respectively. 
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As a second example, we consider a particle interacting with a complex 
scalar field ¢(x) whose Lagrangian is given in Table I, Chapter III. The 
simplest interaction term in the action is 


J 96(x)5(x — 2(s)) ds, (4.22) 


where g is a coupling constant. The fundamental coupled Eqs. 4.12 and 
4.13 become in this case 


(CX? + m?)$(x) = g5(x), (4.23) 
for the field, and 


moc2" = g J dsd(x) d"[6(x — z(s))] = K*, (4.23’) 


or the particle. It is easily verified that the Minkowski force in the last 
equation satisfies the necessary condition 
K*z, = 0, 
for 
+o 
K*z, = 9 J ds$(x)z,[6(x — z(s))] = g(x)d(x — z)]} = 0. 
—o 

It must be noted that the classical aspect of the scalar field, such as 
the z-meson field, is a much less common experience than the electro- 
magnetic field. Whereas in every-day experience we almost always have a 
very large number of photons and therefore can describe them by the 
classical electromagnetic field (in other words, we see their field aspect), 
we can only deal at present with the particle aspect of the x mesons. It is 
conceivable that, in spite of their massive character, when we must deal 
with a very large number of x mesons, we also should be able to describe 
their behavior classically by a pseudoscalar field. 

The form of the Lorentz force Eq. 4.20 is experimentally well esta- 
blished; the correct form of the interaction of a particle with the scalar 
field is not. The form, Eq. 4.22, above is the simplest possibility, which, 
however, may not be the correct one. In the case of a static field, 6 = $(r), 
¢ =0, Eq. 4.23 reduces to 


(A? — m’)d(r) = — g(r), (4.24) 


and we can give an interpretation similar to the classical gravitational or 
electrostatic field. The scalar fields have a source, the nuclear matter with 
a mesonic charge g, and one can regard the mesonic field as arising from a 
distribution of nuclear matter in the same way as one regards the gravi- 
tational field as arising from a distribution of mass, or the electrostatic 
field from a distribution of charge. The solution of Eq. 4.24 has the form 


oe) = Z ger 
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(see Chapter IV), which falls much more rapidly than the gravitational 
and electrostatic fields which go as 1/r. This is due, of course, to the mass 
of mesons and makes it even more difficult to observe the classical aspects 
of such fields. 


3. Interactions Between Fields 


The description of fields and particles and their interactions discussed 
in the previous section is quite unsymmetrical. The particles are described 
by coordinates z*(s), the fields by functions y*(x). Classically, field and 
particle are two different and distinct concepts; hence their descriptions 
are not on the same basis. Quantum theory, in particular quantum field 
theory, attempts a synthesis between these two concepts. Here, both par- 
ticles and fields are described by functions w(x), in the sense that one 
field (or particle) is the source of the other. In other words, there are only 
fields or particles. 

Although the field quantities y“(x) considered in quantum theory of 
fields are not ordinary functions but operators, we briefly consider in this 
Section the interactions of fields with other fields on a classical level. This 
treatment can be considered as an approximation in some cases and is 
intended as an introduction to quantized field theory. Furthermore, we 
want to show how gauge invariance and the equation of continuity for 
the current j%, = 0 are tied together. 

Let the charged particles be described by a scalar field ¢(x) and as 
the source of the electromagnetic field, and vice versa. The total Lagran- 
gian density is of the form 


L=L0($) + Lo(A*) + L,(¢, A”), (4.25) 


where L,(p) and L,(A“) are the Lagrangian densities of the free scalar 
and electromagnetic fields given in Table I. We determine L, from the 
requirements of Lorentz invariance and gauge invariance. The simplest 
possibility is not a term of the form A*j,, aS one might expect, because 
this form is not gauge invariant. We proceed as follows. We know that 
L(A") in Eq. 4.25 is invariant under the gauge transformations 


A, >A, +A, (4.26) 


The term Lo(¢) is invariant under the gauge transformations of the first 
kind, 


o> ge, 
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where A is a constant. If we now require that L be invariant under Eq. 4.26 
and a gauge transformation of the second kind for $, 
gb > ge AG), (4.27) 


simultaneously, L,(¢, A“) is determined uniquely. The simplest way to see 
this is to note that the expression (id, — eA.) is invariant under Eqs. 
4.26 and 4.27. We thus replace in L(¢) the differential operator 6, by’ 


0, > (id, — eA,), (4.28) 
and obtain 
L= —}F"F,, + 4[(id, — eA,)(— id" — eA")p* — m?oo*}]; (4.29) 
or, separating Lo(#) out, we have 
L = L(A") + Lo(o) — j,A" + 407A, A"oo*, (4.29') 
where 
le ° 
i, = —5 (630 - 6,0) (4.30) 


is the current density. The fundamental coupled field equations deter- 
mined from the Lagrangian, Eq. 4.29, are 


Fe" = —j* + e7A"pg* = — : [o(— id" — eA")p* + $*(i0" — eA")o], 


(4.31) 
and 
[(id, — eA,)? — m?]¢(x) = 0, (4.32) 
in which one field acts as the source of the other field in a not very obvious 
way. 


The Lagrangian, Eq. 4.29, is invariant under the inhomogeneous 
Lorentz transformations which leads to a total energy-momentum tensor 
that satisfies 


THY, = 0. 
The energy momentum of both fields together is conserved, as there is 
exchange of energy and momentum between the fields. 

As a second example, we describe the charged particle by a four- 
component spinor field p(x) satisfying the Dirac equation. The Lagrangian 
is given in Table I, Chapter III. The interaction term in the Lagrangian 
will be obtained again by the requirement of Lorentz and gauge invariance, 


Eq. 4.26, and 
> per AG), (4.33) 


1 ¢ = | in this section. 
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If we replace 0, in the Lagrangian for the Dirac field by the gauge-invariant 
derivative (id, — eA,,) according to Eq. 4.28, we obtain 


L = L(A") + Lop) — eA,(xby"y, (4.34) 
in which L, is again of the simple form A,j”, because, for a Dirac field, 
jt = eby"p. (4.35) 
The coupled field equations are now 
[y"(id, — eA,) + m]y =0, (4.36) 
and 
FO = —j = — ey'y. (4.37) 


In these two examples we have determined the form of interaction by 
the requirement of gauge invariance of the second kind. We arrive at a 
fundamental idea if we present the argument as follows. Suppose we have 
a matter field with a current density of the form of Eq. 4.30 or Eq. 4.35, 
which satisfies the conservation law, J*, = 0, a so-called conserved current. 
We may start with this assumption of charge conjugation. Thus, the matter 
field is invariant under a gauge transformation of the form of Eq. 4.27 
or Eq. 4.33. But the Lagrangian of the matter field alone is not invariant 
under such a transformation, unless we introduce a vector field A,(x) 
satisfying the gauge transformation, Eq. 4.26, to counteract the variation 
of the matter field. Thus, from this point of view, a conservation law, here 
charge conservation, forces us to introduce a vector field through which 
the particles interact. The charge we are discussing here is purely the 
electromagnetic charge. This idea is susceptible to generalization, as will 
be shown below in the discussion of the so-called Yang-Mills field. 


*Vector Fields.’ The electromagnetic field A,(x) is a vector field with 
zero mass which introduces an additional gauge condition. It is important 
in the theory of elementary particles to consider vector fields with nonzero 
mass. In this case there are no subsidiary gauge conditions. 

Let us denote the vector field by $,{x) and its mass by m, and let us 
investigate directly the electromagnetic interactions of this vector field 
with the electromagnetic field A,(x). As before, only the so-called gauge-' 
independent derivatives of the vector field, 


D,$,(x) = (10, — eA,)o,(x) (4.38) 


should enter into the Lagrangian; this guarantees gauge invariance with 
respect to the electromagnetic field. 


1 Sections with * may be omitted at first reading. 
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We assume a Lagrangian that is bilinear in $,, @; and its gauge- 
invariant derivatives. The most general such Lagrangian is of the form 


L = Agx(Di.b*°)(D,o°) + m?o*"6,, (4.39) 


where A®” is an invariant or isotropic tensor, one which has the same form 
in every coordinate frame. We can determine the most general form of 
invariant tensor of rank four by a multiple application of the definition 
of Lorentz transformations (Chapter I); i.e., A”,g’’A,’ = g”’, and obtain 


Ab = %9""Gap + BOSD, + Y95Ie: (4.40) 


where a, B, y are arbitrary scalar functions; in our case these must be 
arbitrary constants in order to avoid the introduction of new fields. 
The above Lagrangian leads immediately to the field equations 


A*D Dyb? = m2bo; (4.41) 


= opp 
or, explicitly, 


oD*D,, + (B + y)D(D.b") — eBO’F pg = md, (4.41’) 
where F*” is the electromagnetic field, and we have used the identity 
D,D, — DD, = ieF ,y. 


It is interesting that, besides m and e, three new constants a, B, 
enter into the theory which must correspond to some properties of these 
vector particles. To see this, we go back to free particles, where the 
equation of motion is now 


a0"O,b, + (B + y)0e(8,4") = m7. (4.42) 


The solutions of this equation can be interpreted best if we make the 
ansatz 

$,= Vv, + o,9, 
where yw, is a divergenceless vector field, 0," = 0 and ¢ is a scalar field. 
Then the field equations split into two parts 


HA 2 = 
(a0*d, — m*)W, = 0, (4.43) 
(a + B + y)0“0,6 = m7¢. 
The first equation represents a vector particle of mass m, the second a 
scalar particle of mass m/(a + B + )'/. It is remarkable that the starting 
Lagrangian, which was written down by relativistic and gauge invariance 


conditions only, contains these two possibilities. 
The appearance of vector and scalar fields together is perhaps of some 
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deep significance. But customarily one restricts the theory to that of the 
vector field by choosing 
a+B+ty=0. 
One of the constants, say «, can be taken to be 1 (this is a change of over- 
all factor in the Lagrangian), then we have 1 + B + y =O or Bf = —(1 + 7) 
so that the Lagrangian contains one more constant in addition to the 
mass, y. This constant must correspond to some intrinsic property of the 
particle which we will now determine. With « = 1, B = —1— y», Eq. 4.40 
becomes 
Aco = 9" Goo — Ge9p + VG5Ge — 9595)- 

The Lagrangian density is now 

L = 4(Dio%, — Dib7)(D"d” — D’") + m?o**6,, 

+ y[(D7.b*")(D,b") — (D*¢*")(D, 6") ]. 


The term in square brackets can be rewritten, and is equal 
ie Kup eve 
= ey VF, (ob Ci) re i) ) ); 


plus two divergence terms, — 0,(G**D,,o”) + 6,(6*"D,”), which may be 
dropped in L (see Chapter III). The equations of motion which follow 
from this final form of the Lagrangian are 
D'F,,, = mo, + iey"F,,, 
D,?, = Do, = ays 

This equation and the current density, obtained according to the general 
principles of the previous chapter, 
JP = ie py(D"d” — D*p") — (D**G*" — D**$**)G, + y0,(O*"S" — $*°G")] 

= ie[(p,D"h” — $ ,D™$*”) + (y + 1)0,(b**$" — O*°O") + G'(D*G** — 

—$*(D,")], 


allows one, then, to interpret (1 + y) as the magnetic moment of the 
vector particle. 


* Yang-Mills and Gauge Fields. We have seen that the conservation of 
charge for a matter field, or equivalently the equation of continuity j%, = 0 
for the current density, introduced the gauge-invariant electromagnetic 
field A,(x), if the total Lagrangian has to be invariant under gauge trans- 
formations of the second kind. 
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There are other conservation laws satisfied by interactions of elemen- 
tary particles besides charge conservation and the conservation laws 
associated with coordinate transformations. 

The elementary particles, or fields of a given kind, occur usually in the 
form of multiplets. That is to say, we have a set of n fields 


vena = 1, 2, tees »”, 


with the same spinor or tensor character. These fields can be considered 
as vectors in a finite n-dimensional vectors space, the so-called internal 
space. According to our general discussion in the previous chapter, we 
can consider now variations of the field quantities corresponding to “rota- 
tions” in internal space; for example, 

Walx) = S.Php(x); (4.44) 
i.e., the transformed fields ; are linear combinations of the old fields at 
the same space-time point x. These transformations are generalizations 
of the simple gauge transformations that we have considered before. In 
accordance with our general theory, the transformations of Eq. 4.43 will 
lead to conservation laws that will be then the generalization of charge 
conservation. 

An important example of such a law is found among the charge multi- 
plets of particles occurring in nuclear forces, or so-called strong inter- 
actions. Here the charge multiplets of the particles are grouped into 
singlets, doublets or triplets, and the corresponding fields transform like 
scalar, spinors or vectors, respectively. Actually, in nature the fields trans- 
form according to the representations of the three-dimensional rotation 
group; this three-dimensional space is called the isotopic spin space, in 
analogy to the ordinary spin. The use of group representations here is 
similar to the use of representations of the Lorentz group, as discussed in 
Chapters I and III, but now it is used for the index « of the fields w,(x), 
instead of the coordinates x. The rotation group admits scalar, tensor and 
spinor representations. 

If the total Lagrangian is invariant under the transformations of Eq. 
4.44, we will obtain a conservation law; the conserved quantity is the 
generator of the infinitesimal transformations corresponding to Eq. 4.44; 
it is called the isotopic spin. There is also a corresponding “‘conserved” 
current density j”, satisfying j*, = 0. 

Of course, it is possible to develop a theory with a more general 
symmetry group in internal space than the three-dimensional rotation 
group we have assumed. The steps of such a theory are the same as above; 
we have to use fields that transform according to representations of this 
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more general symmetry group. There are indications that the elementary 
particles may obey approximately a more general internal symmetry. 

Now, if we start from these conservation laws, such as the isotopic 
spin current, and require the invariance of the total Lagrangian under 
transformations of the type 


dp = gw"(x)o,W, (4.45) 


where o,, «= 1, 2, 3 are the Pauli matrices, which are allowed under 
Jj‘, = 9, then we are forced to introduce three-vector fields bi(x) coupled 
_ to the matter field y to counteract the variation of the free-field Lagrangian 
corresponding to W. This vector field, generalizing the electromagnetic 
field, is called the Yang-Mills field. 

One can go one step more. There are two other conservation laws 
known in strong interactions, the conservation of nucleon number, which 
accounts for the stability of matter, and a new type of charge, the so-called 
hypercharge which accounts for some properties of the so-called new 
Strange particles. These two conservation laws are associated with simple 
gauge transformations of the fields. Following the previous discussion 
they give rise also to two new massive vector fields, one coupled to hyper- 
charge, the other to nucleon number. 

To get a classical feeling of these important questions of the quantum 
theory of elementary particles, the interested reader should consider the 
Lagrangians, the infinitesimal transformations of the fields, generators 
and currents. They are straightforward applications of our general for- 
malism developed in Chapter III. (See exercises at the end of this chapter.) 


4. Solutions of Field Equations: Green’s Functions 


The discussion and the interpretation of the coupled equations of 
motion for the field and the particle, derived in Section 2, will make up 
essentially the rest of the book. This is no easy matter, as the solution of 
the coupled partial differential equations is quite involved. In this section 
we discuss the solution of one of the coupled equations and illustrate a 
powerful method of finding the solutions, namely the method of Green’s 
functions. 

Let us consider first the case of electrodynamics. Equation 4.3, with 
the choice of gauge 


Al, = 9, (4.46) 
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reduces to 
1 
ge alg (4.47) 


We shall find the general solution of Eq. 4.47, and consider the auxiliary 
condition, Eq. 4.46, subsequently. 
If G(x — x’) is a solution of the equation 


T?G(x — x’) = 6*(x — x’), (4.48) 


then the most general solution of Eq. 4.47 is of the form 
1 
A*(x) = A*(x) + - J dx'G(x — x’)j*(x’), (4.49) 


where A*(x) is any solution of the homogeneous equation; i.e., 
07A%(x) a: (4.50) 


Clearly A"(x) given by Eq. 4.49 satisfies Eq. 4.47. We must keep in mind 
from the beginning that the solutions of Eqs. 4.48 and 4.50 are not unique, 
unless specified by boundary conditions. Additional difficulties will arise 
due to the 6-function character of j“(x) in the case of point particles. 

The Green’s function G(x — x’) has a simple physical meaning. It 
represents the field at the point x due to a unit current density at the 
point x’. In Eq. 4.49 we integrate over all points x’ with a weight factor 
j"(x) to obtain the total field due to currents. 

Green’s functions can be obtained by taking the Fourier transform of 
Eq. 4.48. Let 


Ge— x)= Cn* | dke(k)e"*°-~; (4.51) 


we also expand the 6 function’ 
5(x — x’) = (2n)~* j ce eam (4.52) 


1 From the defining equation for the 6 function 
[ ax: 10098(« = x) =f, 
and using (4.51) for f(x), we get 
fx dk eet Bx — x) = 1, 


or 
i) dx! etklz-2) 8(x — x’) = 1; 


ie., the Fourier transform of the 6 function is one. 
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Inserting Eqs. 4.51 and 4.52 into Eq. 4.48 and operating with [_]? under 
the integral (differentiation and integration have been interchanged), we 


obtain 
~ J dkG(k)k2e“MO*9 = f dhe“ BO“*, 


Hence the Fourier transform of G(x — x’) is given by 


G(k) = —1/k?; (4.53) 
thus, by Eqs. 4.51 and 4.49, we have the final results 
G(x — x’) = —(22)~* [ dkeW#O-*)/k2, (4.54) 


and 


AM x) = AM(x) — (2n)~4 : J dk dx'j"(x'e“#O-7/k2, (4,55) 


Remark. We discuss now the above procedure in a more general form and 
context. We consider x and x’ as continuously infinite matrix indices and a 
linear operator #. In Eq. 4.48 Y is the d’Alembertian []?. The underlying 
vector space is an infinite-dimensional Hilbert space whose vectors we write in 
the form of kets; e.g., |x>. The basis vectors |x> are orthogonal, but not 
normalized. We have <x|x’> = 84(x — x’). Note that the x’s are four-vectors, 
and, although we use quantum-mechanical notation, everything is classical. 
The Green’s functions satisfy then, in matrix notation, the equation 


Seb (4.56) 


ie., the Green’s operator G is the inverse of the operator Y. The Green’s 
functions themselves are the matrix elements of G : 1 


G(x — x’) = <x/G|x’>. (4.57) 


Any inhomogeneous equation £ f = ¢, where f and ¢ are vectors in Hilbert 
space, with <x|f> = f(x), <x/¢> = 4(x), has the formal solution 


where fo is any null vector of the operator Y ; Le., Y fo = 0. This is precisely 


Eq. 4.49. 
Assume now that we know a complete set of orthogonal eigenvectors of 
the operator G 
G|k> = 9, |k>, (4.58) 
with 
<xlk> = ¢(x); 


the index k is, in general, continuously infinite, in which case <k|k’> = 8k — k’). 
We expand the vectors |x> in this new basis |k> and obtain 


lx> = J |k><k|x> dk. 
1The quantum-mechanical bra, <|, and ket, |>, and the matrix notation in 


Eq. 4.57 is just a convenient way of writing the eigenfunctions and their expansion; 
it has nothing to do with quantum mechanics. 
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If we use this expansion in Eq. 4.57, we have 


G(x — x’) =f <xIG|k><k|x’> dk = f <xlkDg,<klx'> dk = J dk oY" x). 
’ (4.59) 


This is the important formula expressing Green’s functions in terms of its 
complete set of orthogonal eigenfunctions. In our example, with Y = [7], 


<«x|k> = e7tkz, 
Gi = 3 
and Eq. 4.59 reduces to Eq. 4.54. 


Because Green’s functions have a direct physical meaning as the 
potential induced on a point x due to a unit source on the point x’, it is 
important to study their explicit forms and properties. 

We first discuss the simple case of a static electric field which in itself 
will be a useful result later. Let the potentials A” be of the form 


A* : (p, 0, 0, 0), 


= 0. 


Equation 4.47 reduces in this case to the Poisson equation 


and 


Ag = —= j= -p. (4.60) 


Using the techniques discussed before, we obtain 
G(x — x’) = (22)73 j dkel-&—*)/k?, 


The integral on the right can be evaluated if we put dk = K7dK dQ. 
(x — x’) =r, and k- r= KR cos 8. 


G(r) = (2n)7? f dKe*®? sin 9 dbdd, ~~ 
or 


G(R) = ; on? dK sin (KR)/K = 1/4nR. (4.61) 
0 
This is, of course, the well-known result for the static field at a distance R 


1 A useful formula for the conversion of a class of three-dimensional integrals into 
a one-dimensional one which is frequently met is 


J dk ek" *(K )= * | “aK Kf(K)sin (KR). (4.62) 


Note also the integral | ss dK sin (KR)/K = 2/2. 
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from a unit source. The Green’s function depends only on the magnitude 
R = |x — x’|. The general solution of the Poisson Eq. 4.60 is then 


$e - {More (4.63) 


4n|x — x’|’ 


where $5 is a solution of the Laplace equation, Ady = 0, and will be 
determined by the boundary conditions. If we require, as usual, that ¢ > 0 
as x 00, we find ¢,(x) =0. 

We now solve the Green’s function, Eq. 4.54, in the general four- 
dimensional; i.e., nonstatic case. We write dk = dk° d*k. The integral in 
Eq. 4.54 can be evaluated either by carrying out the k° integration first 
or the d*k integration first. In the former case, 


+0 
G(x = x’) = (22) 4 [ex] ake ee ee -xe _ k?). 


We use the method of analytic continuation on the variable k°. The 
integrand is a function of k° with two poles at 


k° = +(k?)/? = 4K. 


In the lower or upper half k° plane, depending on the sign of (x° — x°’), 
the integrand goes to zero at infinity exponentially. Therefore, we can 
transform the k° integration into a contour integration. Various contours 
are possible giving rise to different possible solutions (Fig. 6) which, how- 
ever, must differ from each other by the solutions of the homogeneous 
equations. Hence the different contours correspond to different boundary 
conditions. 


k°-plane 
Cr 
— ee ee ee oe >— — .— — — — — 
mie +k 
Cr 
——_—— — — — Pe — — — — —. 
Cy 


Fig. 6. Contours for k°—integration for various Green’s functions 
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Retarded Green’s Functions. Let us choose the contour Cp. We must 
close the contour in the upper half plane for (x° — x’) <0, and in the 
lower half plane for (x° — x’) > 0. We denote the corresponding Green’s 
functions by D'*'(x — x’). In the former case the contour does not include 
any poles and the integral is zero; in the latter case the k° integration 
gives just the residue of the integrand at the two poles. 


feae te ERE — k*)yreenani x oer’ 


7 EKO 29°79 (x9 = x), 
where © (1) is the step function: 


At)=0 for t<0, 
=1 for t>0. 


(4.64) 


Thus, with |x — x’] =R, 
D'*"(x — x') = (2n)~42ni J dKK7e'** °° 27 sin 0 do[eK-**) 
+ @ KRP~ONG(x° — x')/2K 
==\(2i) “ni fake Kfe*- > 


ner EK = 29071 9/0 aa x’) j eiKRudy, 
7 an A(x? — x°)[5(x° — x + R) + 5(x® — x” — R)]. (4.65) 


Because R>0, only the second 6 function contributes, in conjunction 
with the 6 function, and we have finally 


D(x — x’) = ere 5(x° — x — |x — x'[). (4.66) 

The form of D‘*'(x — x’) clarifies the name “retarded functions” given 
to these solutions. The field at the “‘point’’ x due to the unit source at the 
point x’ is different from zero only if the time of x’ is earlier than the time 
at which one evaluates the field at x. Moreover, we see from Eq. 4.66 
that only signals travelling with a velocity of light from x’ to x contribute 
to the field. To see this and the invariance property of the Green’s function 
better, we rewrite Eq. 4.65 in the form 


D'*(x — x’) = (1/2m)5[(x — x’)?J0(x° — x’). (4.67) 


The function 6(x° — x®’) is invariant under inhomogeneous orthochronous 
Lorentz transformations, and so is D‘*'(x — x’). The invariant 6 function 
6[(x — x’)?] shows that D'**' is zero unless the vector (x — x’) is light- 
like. If we draw a light cone with x at its center, x’ must lie on the backward 
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light cone. (Fig. 7.) Later we shall make use of Eq. 4.67. In its derivation 
we have used the formula 


6(t? — a”) = x [o(t — a) + 6(t + a)], (4.68) 


x (t=O) 


R=|X-x'| 


Fig. 7. Field point x (t = 0) and the source point x’ at retarded time 
d R 


t =— Ric. 


which we prove here. The right-hand side of this equation is 
(1/2a) J dif()[5(t — a) + 5(¢ + a)] = (1/2a)L fla) + f(—a)], 


and the left-hand side can be transformed as follows: 


| . dté[(t — a)(t + a)] f(t) = —_ dté[(t — a)(t + a)] f(t) 
+ ip ‘ dtd[(t — a)(t + a)] f(a). 


If we set in the first term on the right t + a = y, and in the second t—a=y, 
we get 


f dySL(y — 2a)yIf(y) + f dvE(y + 2ay1fy) = = [a= "2) eee 


Q.E.D. 
Equation 4.68 is a special case of the more general relation 
(a — a,) 
o[g(a)] = >), —— —-, 4.69 
2 Ga a 


where a, are the roots of the equation g(a) = 0. Other useful formulae 
for 6 functions that we need are 


&(—a)= (a); (a) = —4(a)/a, | 


b 


dad(a) = 0(b). 
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Advanced Green’s Functions. We now choose the contour C, in Fig. 6, 
and call the corresponding solution the advanced Green’s function, for 
in this case 


=0 for x°>x” 


D234 ee ‘ 
s ew forex”. 


Thus, D**’ (x — x’) represents a field at the point x at time ¢ due to unit 
charge at x’ at a Jater time t’. This type of solution may seem contrary to 
our physical idea of propagation with increasing time; however, due to 
the causal character of our equations, they are perfectly good solutions. 
This point will be made clearer where we use D**’, later, to evaluate the 
field. 
The evaluation of D**’(x — x’) is exactly the same as before, and we 
find 
D***(x — x’) = (1/22)6[(x — x’)?]O(x* — x°), 
or (4.70) 
D***(x — x’) = (1/42R)6(x° — x” + R). 

Again, D**" fails to vanish only if (x — x’) is light-like and x’ is on the 
forward light cone with respect to x. 

The Green’s functions D'*' and D**” are highly singular functions, zero 
everywhere except on one side of the light cone, where they have a 6- 
function singularity. From their definitions we find that 

D(x — x’) 20 as (x°—x°)> +0, 


OD™/6(x° — x°) +0 as (x° — x) > +00; 


(4.71) 
and 
D(x —x'‘) +0 as (x°—x%)>—0, 
aD**"/A(x° — x°) +0 as (x° — x°) > —0. 
The difference of the two Green’s functions 
D(x — x’) = D'*(x — x’) — D**"(x — x’) (4.72) 


is a solution of the homogeneous equation. This function plays an im- 
portant réle in applications. We get 


D(x — x’) = (1/22) 5((x — x’)?)e(x° — x®), (4.73) 
where e(¢) is the so-called sign function 


+1 if t>0, 
Oe wi if t<0, (4.74) 
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and is invariant under orthochronous Lorentz transformations. Because 
D™*( —x) = D*"@), (4.75) 

the homogeneous Green’s function D(x) is an odd function 
D(x’ — x) = —D(x — x’). (4.76) 


Going back to the integral form of D‘*' and D**’, we derive a useful 
spectral representation for the D function: From 


D=p* — p**¥ = i(2n)~? Ji chet =x aggIRe = x") ae erikixt= xy 


x ex? — x" 2K, 
_ and, using the identity 


et ik(xo—x%) = f dk°5(k° . Re ae. 
we find 


D(x — x’) = i(2n)~? § dke~®O-*[5(k° + K) + 5(k° — K)Je(x° — x)/2K; 
or, with Eq. 4.67, 
D(x — x’) = i(2x)~3 j dke7™*~*§(k?)e(k°). (4.77) 


This equation will allow us to show that the time derivative of the D 
function at the origin is a three-dimensional 6 function. 
oD(x — x’) 
ax? 


+ 


= (0 we | dke™—-*) | Ak°k°S(k?)e(k°). 


The integral over k° gives just unity, because 


§ dk|k°|[6(k° — K) + 6(k° + K)}/2K = 1; 
hence 
OD(x — x')/A(x® — x) leo: = 6(x — x’). (4.78) 


This equation can be written covariantly in the form 


| do, eD(x — x’) fx) 


OX, 


= f(x’). (4.78') 


x’ 8 
Sometimes it may be convenient to introduce a covariant “spatial” 6 
function 6" 

dD/Ox,, = 6"(x — x’) = n"d(x — x’), 
such that 


| do,d"(x — x')f(x) = f(x’) 


when both x and x’ are on the space-like surface o. 
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Other Invariant Green’s Functions. Referring back to Eq. 4.54 and 
Fig. 6, there are two other important Green’s functions. One is defined 
by the contour C;. Depending on the sign of (x° — x°’), we get the con- 
tribution of one or the other pole only. 

D(x — x’) = O(x° ~ x°)D*(x — x’) — O(x” — x°)D~(x — x’), (4.79) 
where D* and D~ are the contributions of the poles at +K and —K, 
respectively, and are given by 


D*(x) = i(2n)~? f d°ke* *e- "2K = i(2n)-? J dke**5(k2)0(k°), 


(4.80) 
and 
D~ (x) = i(2n)~* J d?ke™ *e**°/2K = —i(2n)~* § dke~™*5(k?)0(—k°). 
(4.81) 
It follows from Eq. 4.77 that 
D=D™— D*”=D*+D-. (4.82) 
We also have the properties 
D*(—x) = —D(x), 
and (4.83) 


@nr=D:, 


where the * denotes complex conjugation. 

The other Green’s function we want to introduce is obtained by taking 
the principal value of the k° integration. We shall denote this function 
by D. Because the principal value of a singular integral is equal to 1/2 
the residue of the pole term, we obtain immediately 


D(x) = 4(D™ + D**). (4.84) 
This function is even and real: ; 
D(x) = D(—x) = [D(x ]". (4.85) 


We have introduced four invariant Green’s functions D‘*', D**’, Dp 
and D. All these solutions differ by a solution of the homogeneous equa- 
tion. Thus, in addition to Eq. 4.72, we have the relations 


D™* — D = D/2, 
p*’ — D = —D/2, (4.86) 
D, — D = 3(D* — D~) = (1/2)D"(x). 


In the last equation we have introduced a second (besides D) linearly 
independent solution of the homogeneous equation, D'(x). 
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Green’s Functions for the Scalar Field. The previous theory of Green’s 
functions can be applied immediately to the case of a single scalar field 
$(x), satisfying an equation of the form 


(C0)? + m”)d(x) = f(»), (4.87) 


where f(x) is the source of the scalar field. In effect, this case is simpler 
than that of electrodynamics, for we do not have any subsidiary condition 
as was the case before [see Eqs. 4.46 and 4.47]. 

From the general expression for the Green’s function Eq. 4.59, with 
G = (QO? + m?)~!, we obtain immediately 


G(x — x’) = (2n)~* § dke~#@-*9/(_k? + m?). (4.88) 


The discussion of this equation is now very similar to that of Eq. 4.54. 
The poles in the k® plane are now at 


k° = +(k? ae m?)'/2_ 


This is the only difference. Exactly the same type of Green’s function 
with the same contours can be defined here and will be denoted by 


Aret A24v A ike 
> b) > > 
and the corresponding solutions of the homogeneous equation by 
a ee 
All the relations between these functions are exactly the same as between 


the corresponding D functions. In the spectral representations, 6(k”) must 
be replaced by 5(k? — m7). 


World fine 


Fig. 8. For massive particles emitted from the source the entire 
world line of the source particle inside the backward light cone 
contributes to the field at x. 


Equations of Motion and their Solutions 159 


As a result of the nonvanishing mass, the invariant A functions are 
also different from zero outside the light cone, as we shall see. For example, 
A'** is different from zero inside the backward line cone and A**” inside 
the forward light cone. This implies that the field at the point x is not 
only determined by the intersection of the world line of the particle with 
the backward line cone, but by the whole history of the world line inside 
the backward line cone. Of course, this means that the velocity of the 
signal from the source to the field point is less or equal to the velocity of 
light. Because the signal is carried now by particles with nonzero rest 
mass, this is what one would expect to happen. (Fig. 8.) 

To evaluate these singular functions explicitly, we notice that the 
defining integral [see Eq. 4.80] 


A*(x) = i(2n)7? § dke~™*65(k? — m7)6(k°) 


converges if we replace x by x — it, where € is a four-vector in the upper 
light cone. One can then define A*(x) as the limit, when €-0, of a 
function which is well defined. The integral 


i(2n)~3 f dke~*=-*8)5(k? — m?)0(k°) 


being an invariant function, is a function of (x — i€)? and m only. Hence we 
can evaluate it, for example, by taking x = (x®, 0, 0, 0) and € = (€°, 0, 0, 0). 


A*(x) = lim i(2n) ~3 | dke ~i#°°-i5(,2 — m?)6(k°) 
g2+0 


k? dkd[(k2 — m?)!/? — ky 


= lim any? | abrerwor- 94° | 2n : 


£9-+0 


= lim iany*| dk(k2 — m?)1/2 9 iK(=? 18) 


Gua, m 


mm? HY(m(x? — it?) 
= i— lim =e OMEEEUS 
87 p49 © M(x — tom) 
where H!) is the Hankel function of the first kind." In an arbitrary frame, 
(x° — i€°) must be replaced by [(x — it)7]'”. 


1 We use the general formula 

2i(z/2)” it ize, 2 -y— 
—=— 12 — 1)*-'dt, Imz>0, Rev < 1/2, 
wget 


(i.e., upper sign for Hy), lower sign or Hy'?)). [H. Bateman, Higher transcendental 
function (McGraw-Hill Book Company, Inc., 1953) Vol. II, p. 83.] 


A, — Se 
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Using Eqs. 4.82 and 4.83, we obtain 


(2 2 fF Em(x2)'2] 

A(x) = At + A7 =2 Re A*(x) = a(x) O(n) | 
at im? HP Tm(x?)"/7] 
A'(x) = 2 Im At (x) = rr Im eC 


The second term in A(x) and A’(x) can be expanded if m(x*)!/? is small, 
and we find 


5(x?) 0(x?) = x?m* * ] 
A(x) = ar e(x) + oe ri é(x) + 7) e(x) + 4 
and f 
ee ae a ae 2) 2\1/2 _ | 
A) = — 5a + aa | men? m(x*) 5 + : 


Comparing this with Eq. 4.73 we see that the first term of A(x) is equal to 
D(x), the second term is nonzero for x? > 0 and shows the behavior of 
the function inside the light cone. 

There are invariant Green’s functions for other field equations; they 
can be discussed, in principle, along the same lines. 


Cauchy Problem. For the general solution of the inhomogeneous field 
equation, we need to know the solutions of the homogeneous equation; 
see, for example, Eq. 4.49. We have indicated that the solutions of the 
homogeneous equation will be determined by suitable boundary condi- 
tions. In connection with this, we give now a general procedure for writing 
down these solutions. 

The problem is to find the solution of a linear, second-order, partial- 
differential equation everywhere, if the function and its first derivative is 
given on a space-like surface. 

For the linear hyperbolic differential equation, 


O74" =0, 


the solution of the Cauchy problem can be written in terms of the D 
functions discussed above: 


A*(x’) -| da ,[A"(x) 0’D(x — x’) — D(x — x')A"(x)], (4.89) 


where A" and A“ are the given functions on the surface o. Indeed A*(x) 
given by this equation satisfies 24" = 0, because D itself is a solution 
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of this equation. We must show now that on the prescribed space-like 
surface o, A"(x) and its derivatives take the prescribed values. Noting that 
D(x — x’) = 0, if x’ is on the same surface o as x, we get 


a 60 | aoe = do,A¥(x) d’D(x - De) cass 
which, by Eq. 4.78’, is precisely A“(x’). Similarly, 


AMX ete = -| do,[A*(x) 0°°D(x — x’) — 6°D(x — x')A"*(x)]l x20 
which reduces to A*’*(x’). 
In a similar way, the solution of the Cauchy problem for the scalar 
field, 
(C0? — m)(x) = 


is given by 
p(x’) =| do,[A(x — x')G"(x) — 6(x) d"A(x — x’)], 


where $(x) and $“(x) are the given values of the function and its derivative 
on the space-like surface o. 

The solution to the Cauchy problem is unique, for if we had two 
distinct solutions, their difference would be one also. But this difference 
and its derivative vanishes everywhere on a space-like surface. Then the 
second-order differential equation [J?A“ =0 or (LJ? — m?)¢ = 0 shows 
that it must vanish everywhere in space-time. 


Solutions of the Inhomogeneous Equation. Having discussed the Green’s 
functions and the solutions of the homogeneous equation for a prescribed 
data at a given time (space-like surface o), we can write now the general 
solution of the inhomogeneous equation, which is really our problem. 

For electrodynamics we go back to Eq. 4.49. Let us choose the Cauchy 
data to be on the surfaces t= —oo or t= +00. Let the corresponding 
solutions of the homogeneous equation be denoted by A,,(x) and A,,,(x), 
respectively. For these two cases, we must use the retarded and the 
advanced Green’s functions, and we get 


6(x® — x® — |x — ’ 
A"(x) = Af,(x) + =| ) ia") dx’, 
ns (4.90) 
AMS) = Ag) + | al Meat ox) = *'D inc) dx 
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In the first case, we determine the field at the point x from the past, 
starting with the initial value at t = — oo and in the second case, from the 
future backwards starting with its initial value at t= +00. Because the 
development of the system is causal, these two methods are equivalent. 

The two solutions A‘,(x) and A%,,(x) are related by a third solution 
of the homogeneous equation; from Eq. 4.72, we get 


Ad (x) = AR (x) + . | D(x — x')j"(x') dx’. (4.91) 


Af measurements are made at distant past and distant future, where the 
field is undisturbed, and then a disturbance is introduced at finite times, 
as in scattering experiments, the integral in Eq. 4.91 determines the net 
effect of the interaction. In quantum phenomena, this is the only accessible 
maseurement, and the “ratio” of A,,, to A;, is the so-called “scattering 
matrix’? : 


Abu(x) = SAf(x). (4.92) 


Thus, even classically, it is meaningful to introduce the scattering ‘‘matrix”’ 
which is well defined by Eqs. 4.92 and 4.91. 


Subsidiary Condition. For the scalar field interacting with a current, 
the solution given in the previous paragraph completes the problem. 
However, in the case of electrodynamics, we have worked in a particular 
gauge such that A”, = 0. 

A, = 0. 


It will be shown now that if this subsidiary condition is satisfied for 
the initial free fields A¥ (x) or A“,,(x) in Eq. 4.90, then it is satisfied always. 
Let us assume now that 
Ainn =9, — Abuen = 0; 


then, by Eq. 4.90 


adv 


adv 
Dl ODIO X) races, ela CDi eee 
Ai [OPE ce a = 2 [ne dx’, 


or, integrating by parts 
AMS == 1 dx’ d pret \j4#y! 1 ’ Ee "\s ’ 
ae x’ aoa SD) dx'D™"'(x — x’)j*(x'). 


The second term vanishes by the equation of continuity j ‘, = 0. The first 
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term can be transformed into a surface integral, passing through the 
point x, 


il fee adv 
At, = — | do, D'* (x — x’)j*(x’), 


adv 
which vanishes, because D'*'(x — x’) = 0 for space-like separations. 


An alternate simple proof is the following: Let A“, = x, which satisfies 
Ox =0. 


Now, if x and y,, are zero on a space-like surface, then y = 0 everywhere. 


5. Further Developments and Exercises 


1. Using the invariance of the Lagrangian for the interaction of fields 
with external currents and particles, show that the total angular momentum 
is conserved. 

2. Using the action integral, Eq. 4.21, for a charged particle inter- 
acting with an electromagnetic field, derive the coupled equations of 
motion and the conservation law for the energy-momentum tensor. 

3. Consider the force between two uniformly moving charged particles 
(for example, one moving along the x axis, the other along the y axis) 
and show that the law “action is equal to reaction” does not hold and 
interpret this result using the conservation of total momentum of the 
particles plus the field. 

4, Evaluate the energy-momentum tensor 7%” of interacting scalar and 
electromagnetic fields given by the Lagrangian, Eq. 4.29, and interpret the 
conservation of total energy momentum in terms of individual energy 
momenta of the fields and the energy-momentum exchange between the 
fields. 

5. Show that Eq. 4.36 is the most general isotropic tensor of rank 4. 

6. Evaluate the current vector and the energy-momentum tensor for 
vector fields, and discuss their interpretation. 

7. Show by the method of Chapter I, Sect. 5, that the 2 x 2 unitary 
unimodular matrices u form a two-dimensional representation of the 
three-dimensional rotation group. Therefore, a field that is a two-com- 
ponent spinor in the internal space transforms as 


Wo a Ug Wp. 


8. For the isospinor field of Problem 7 and the isovector field trans- 
forming as 


Wo = Revs, 
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where R is the three-dimensional rotation matrix, calculate the generator 
of the infinitesimal transformation, assuming the Lagrangian is invariant 
under these transformations. 

9. Discuss the introduction of the Yang-Mills and the gauge fields, 
starting from current conservation and gauge transformations of the 
second kind of the type of Eq. 4.50. 

10. Prove Eq. 4.69 by first showing that 


5(g'(a,)(a — a,)) = 5(a — a,)/9'(a,). 
11. Relate the principal-value integral to the residue of the pole, and 
- prove Eq. 4.84. 
12. Solve the equation (A? — 4)¢(r) = p(r) by the method of Green’s 
functions. In particular, find the scalar mesonic field of a point nuclear 


charge. 
13. Find the Green’s functions for the Dirac equation. 
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AND RADIATION REACTION 


In this Chapter we concentrate, in detail, on the electromagnetic radia- 
tion. First, the field of a moving charged particle is discussed; then we 
define the radiation field and investigate its properties. This is followed by 
a discussion of the effect of the radiation field on the charged particle and 
the modified equations of the motion of the particle with radiation re- 
action which present unusual features. Additional requirements must be 
used to obtain physically meaningful solutions. Finally we discuss the 
classical mass renormalization and the problem of the electromagnetic 
nature of the mass of the electron. In the application of the results to the 
real physical situation, particularly when one speaks of the nature of the 
electron, quantum effects must be taken into effect. But the problems dis- 
cussed here are fundamental for the consistency of the classical theory and 
what we may expect when the theory is quantized. 


1. Radiation Field of a Moving Particle 


Lienard-Wiechert Potentials 


Let us put the external free field Ai, (x) = 0. We want to evaluate the 


out 


field produced by the particle alone whose current density is given by 


i =ec [#0 5(x — z(s)) ds 
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where 2'(s) is the four-velocity of the particle and s the proper time. The 
field can be evaluated from Eq. 4.90 and we get 


A" = e ff D'"(x — x’)2"(s) 6(x’ — 2(s)) ds dx’. (5.1) 
We use Eq. 4.67 and carry out the x’-integration 
+ 00 
A(x) = = | ds S[(x — z(s))?]2"(s)0(x° — z°). (5.1’) 


The argument of the 6-function, (x — z(s))?, has two roots with respect 
to s; the 9-function picks up the root with x° > z%s). If we note that 


= [(x — 2(8))?] = —2(x — 2(s))"Z, ee) 


and use Eq. 4.69, we can integrate Eq. 5.1’ and find 


Ht 
Al) = aa 2 


4n (x — z)’z, G3) 


These are the Lienard-Wiechert potentials. The right-hand side has to be 
evaluated at the time so such that x — 2(59) is light-like and x° ~ z°(so) 
= R> 0. This is just the point where the trajectory of the particle cuts 
the light cone with vertex at x. (Fig. 9.) Only from the point z(sy) can a 


z(s,) 


z#(s) 


Bice) 


Fig. 9. World line z#(s), source points for retarded and advanced 
potentials and the velocity z+(so). 
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light signal reach the point (x, f): The three-dimensional distance 
between particle and the field point is |x — z(s)| = R and the time 
interval t — t'(so) = R/e. 

Had we used an advanced function the only point of the trajectory of 
the particle which contributes to the field at x would be 2(s,) (Fig. 9). 

In the case of the scalar field, the Green’s functions Ate (x — x’) are 
also different from zero inside the light cone. Therefore, referring to 
Fig. (8) the whole trajectory inside the light-cone contributes to the field 
at x. This corresponds to signal propagating with a velocity less or equal 
the velocity of light. In quantum mechanical language, if light signals are 
interpreted as exchange of massless photons, in the case of scalar field we 
can talk of exchange of particles with mass, mesons. 

We now write the Lienard-Wiechert potentials, Eq. 5.3, in a more 
familiar form. Introducing the velocity of the particle v, the distance 


R = x — 2(5Sp), n=R/R (5.4) 
we have 


Ze ilee ee 


dt 
(x — z)’z, = (Re — R-y) as° 


and Eq. 5.3 can be split into a scalar and vector potentials 


- @ 1 
$= GRO) i 
e B ; 
where 
B = v/c. 


Note that the velocity of the particle refers to the retarded time 
t'=t—R/c; and R is the distance between the field point x and the 
position of the particle at the retarded time ¢’. 

As a special example, we consider a particle at rest. 


2": (s;0;,0,,0) (5.6) 


z":(1, 0, 0, 0) 
and we obtain 
A=0 and ¢@=e/4nR. 


This is the simple Coulomb field. 
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The Field Tensor Fv’ 


In evaluating F*” it is more convenient to go back to the integral 
expression, Eq. 5.1, rather then differentiate the result, Eq. 5.3. We have 


[D'(x — z)]Z". 


v 


0 

H,Y, = d. 
AY "aie | Ss ae 

Since D‘*'(x) is a function of x?, we get 

ep = (x — z)*] 
Abe? = d sit 
: “| *a@—z)] ox, 

dD ds 
tiie 
| ane 


or, using Eq. 5.2, 
2 von ret 
Ab =--—e ds ae hi, ap" a 
(x — z)’z, ds 
We integrate this equation by parts 


= vsuy to = vom 
AMY = ~e| D“ i as | +e [aso —z) es | ze | 


(x — z)"Z,|_. ds |(x — z)*z, 


The first term is zero. In the second term we replace D'* by 5[(x — z)?] 
and use again Eq. 5.2 and integrate 


Any — & 1 d = ~ | 


~ On (x — 2), ds L(x — ra) 


With 
(x = z) Ze — R (5.7) 
(x - 2)", = 
the final result is 
u 
Fe = = 3 [(x — 2)!2°R — (x — 2)'2°Q + (x — 22” 


— (x — z)"2*R + (x — z)’2"Q — (x — z)’z*]. (5.8) 
Before we discuss this equation in detail let us look at two simple 


examples. For the Coulomb-field with Eq. 5.6 and R=x°—2z° and 
(x — z)* = (x" — sd"°) we obtain 


é 


FY = 
4nR? 


[Ox — 5 54%) 5° — (xY — 5 5°) 500] 
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or, 
e 


Bi 0) E = — ——R. i 
4nR> ed) 


As a second example, consider a uniformly moving particle 


. f dt 
= (Ve z=(c, v, 0,0) = x(1, B, 0, 0) 


then 

R = (x — z)°z, = (x° — oo! + (x! — z!)v gi 

Ss 

and we get 

e 

FP = L(x? — 2°) 58° S* By + (x9 — 2°) 51 3” — (u —)] 
= — (1 — py! 5° — 5°! 5H) (5.9’) 
4nR? ; 

or 


oa mes — bd = = 
E, is 4nR2 (1 B); E, E, 0. 


The field F*” given by Eq. 5.8 consists of two parts: 

(a) Field depending on the velocities z alone and proportional to 
1/R?—near field 

(b) Field depending on the acceleration z and proportional to 1/R— 
far field or radiation. 

Let us look at these two terms in terms of the electric and magnetic 
fields. 


1/R?-field: 
Ba) = 7 ae 
B(é) = 7a (a (5.10) 
1/R-field: 
Ps res : nies ; = ae (5.11) 
pe) = 2B x B+ min B xB) 


4ncR (1 — n-p)? p 


where B = (1/c)v and the velocity v refers to the retarded time. 
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The radiation field, Eq. 5.11, simplifies for a slowly moving particle, 
B <1, to 
e 


4ncR IOs | 


E(z) = 


MY hod: 
BO = 4ncR 


=n x E(Z) 


showing that electric and magnetic fields and the direction n which con- 
~nects the field point to the position of the particle at retarded time are 
mutually orthogonal. 
The Poynting vector of the radiation field is given by 


nx (5.12) 


2 
ee = é 
SG). BO e e x B)] x [n x 7. 
The energy flux is radial and is in the direction R. The value of the energy 
flux in the solid angle dQ is independent of R and equals 


2 
Gyre dQ(n x )?. 


To find the energy radiated per unit time we integrate over the sphere of 
radius R. With n x B = (1/c)¥ sin 6 we have 


2 
[s-a- are Han [ sin 6 do 
_ 2 e? s? 
34703 


(5.13) 


In the following sections these and other properties of the radiation 
field will be investigated in detail and quite generally without making 
the simplifying assumption, 8 < 1, used above. 


2. Properties of the Radiation Field 


Null Fields 


We start the discussion with the field F#” produced by a moving 
particle given by Eq. 5.8. This field satisfies the free Maxwell equations 


OF’ /dx” = 0 
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everywhere except on the world line of the particle. Consequently we have 
the conservation law for the electromagnetic energy momentum tensor 


dT” Jax” = 0 


everywhere except on the world line. 
It follows from Eq. 5.8 that the field of the moving particle outside 
the world line satisfies the properties 


Eger, = 0 (5.14) 
and 
F yy?™"(x — Z)q = 0. (5.15) 


These relations follow from the fact that e””’ is antisymmetric whereas 
the products F,,F,, and F,,(x — z), are symmetric in the indices (uc) and 
(vo), respectively, as can be seen by Eq. 5.8. It is convenient to introduce 
the dual electromagnetic field FF, defined by 


a a (5.16) 
The Eq. 5.14 is then 
Fi ** =. (5.14') 


This quantity is one of the invariants of any electromagnetic field and 
Eq. 5.14 indicates that the vectors E and B are orthogonal. The second 
property, Eq. 5.15, shows the orthogonality of E and B to the vector 
(x — z). Thus, these two properties are quite general for the total field F“” 
of the particle. 

The field far away from the world line, namely the radiation field 
which depends on the acceleration Z and varies as 1/R with the distance, 
has additional special and important properties. 

The radiation field is given by’ 


Pov) = (Z) Fo IR = 272" - (x — 2270 — HO Many GD 


where (1 + v) means two additional terms with p and v interchanged and 
Rand Q are given by Eq. 5.7. 

We can now verify from Eq. 5.17 that the radiation field has the 
following properties in addition to Eqs. 5.14 and 5.15. 


FAO?) =0 (5.18) 
and 
FREKx = 2 = 0, (5.19) 


1 In Section 5 we shall give a covariant definition of the radiation field and a close 
expression for it on the light cone. See Eqs. 5.67 and 5.71. 
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which follow immediately, if we note the relations 
(x — 2(So))? = 0, =) sand 2°70 (5.20) 
The Eq. 5.18 expresses the equality of the magnitudes of the vectors 
E and B. 
Any electromagnetic field for which the two invariants (see Eq. 3.38) 
F,,F"” and F,,F**” are identically zero as in the above case may be defined 


as a pure radiation field. It is also called a null field. Now we discuss more 
closely the physical significance of the radiation conditions. 


Plane Wave Decomposition of the Radiation Field 


Consider first a plane wave moving in a given direction, say x'. The 
potentials A” depend on the time coordinate and one spatial coordinate, 
x', only. The subsidiary condition A%, = 0 becomes 


AS, + Al, =0. 
Let us again choose the gauge 
A°=0 and V-A=0. 


Hence A’, = 0 and the field equations reduce to the one-dimensional wave 
equation 


A oo + Ait = 0. 
For the A'-component we obtain 
- A'og = 0, 
or 
A‘, = const. 


A constant in the last equation which is different from zero gives a constant 
electric field which does not represent a wave. We, therefore, choose 


AS= 0: 
For the A? and A? components of the field we have the equations 
Ao al A’, i 
and 
Ao a Avi q 


Hence both A? and A?® are functions of the difference (x° — x!), or 
A = —A?, and A% == —A} . 


From these relations we can obtain the non-vanishing field components 
which we show in matrix form. 
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0 0 -E, -E; 00 B, 
= 0 —B,; B, 


(=) 
(=) 
oO 
o 
om & 


The field invariants are now easily calculated. We have 


E-B = A%,A\, i A» A, = 0 


and 
E? — B? = 0. 
Thus, for a single plane wave, we have the well-known properties 
E1Blin 
and 
|E| = |B 


where n is the direction of propagation. However, these relations are still 
not sufficient to fix E and B uniquely. At a given point x the vectors E, B 
can still change as a function of time in a plane perpendicular to n. Thus, 
plane waves may still be distinguished, for a given n, E and B, by the 
direction of the vector. This directional property has observable conse- 
quences. One can therefore introduce a so-called polarization degree of 
freedom of the radiation field which then describes this additional direc- 
tional property of the field. Since E and B vary in a plane, two numbers 
are sufficient to describe the polarization of the radiation; in other words 
the radiation field has two degrees of polarization. Still in other words we 
shall refer to the transversal character of the electromagnetic radiation 
field. 

We describe the polarization covariantly as follows. 

Consider the monochromatic plane wave a 


A*(x) = C# e7 / (5.21) 
with k? = 0 which is a solution of the wave equation (]?A“ = 0, where 
C* is a constant vector. The subsidiary condition A*,, = 0 implies 

k*C, = 0, (5.22) 
i.e., the amplitude C“ is orthogonal to the wave vector k,,. It is convenient 
to choose a k-dependent basis in the Minkowski space, i.e., four 4-vectors. 


(See Part I, Eq. 1.14.) 
CV 012, 3) 


with components e” such that the components of k” are always 
ke) = (k®, k°, 0, 0). 
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The amplitude C* can be expanded in the form 
Ce e bee 


Now the condition, Eq. 5.22, gives 
Y pete) = k°(b = Bb) =0 
7 


or 
bO = HM, 


The magnitude of b™ is then 
b? = —(b2” + py, 
The electromagnetic field corresponding to Eq. 5.21 is 
FY = i(C"k” = Ck) e—. (5.23) 


This equation taken in the basis e{” clearly shows the transverse character 
of the field: The components of E and B in the direction of k vanish: 


foe 0) er 10) 


Equation 5.23 is a special case of a more general and useful theorem. 
Any null field can be written in the form 


FM = 1*(x)o"(x) — I(x) E*(x) (5.24) 
where the vector fields /“(x) and €(x) are such that 
I? =0 
and (5.25) 
PG, = 0; 


hence ¢, is a space-like vector, ¢? <0. With Eq. 5.24 the two-conditions 


F’F,, =0 
and 
Per 0 
are satisfied. 
For an arbitrary field satisfying the field equation (]?A" = 0 and the 
subsidiary condition A“, = 0 we make the Fourier decomposition 


AM(x) = (2n)~* § a'(k) e-™ dk. (5.26) 


From the reality of the field A"(x), the complex Fourier components 
a"(k) satisfy 
a,(—k) = aj(k). (5.27) 
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If we insert Eq. 5.26 into the field equation and the subsidiary condition, 
we obtain the conditions 


k? =0 
and (5.27) 
a,(k)k* = 0. 
The Fourier decomposition of the field tensor F“” is therefore 
F(x) = ((2n)~* f i(a"(k)k’ — a’(k)k") e~ ™ dk. (5.28) 


Energy and Momentum of the Radiation Field 


The energy momentum tensor 


TH” = FURY — gL, 
with 
= —-]F"F, 


can also be written in the symmetric form! 
TY = 4(FY°'F + F**°F*’). (5.29) 
For a single plane wave with F“” and F*#” evaluated above we get 


E> En eeebens | En; 


ei E? 0 0 
= —E,E, —B,B, 
—E? 


where n (,, 2, 3) is the direction of propagation of the plane wave. We 
see by comparison with Eq. 3.95 that the Poynting vector is 


S=cE’n 
and the momentum density of field 
=— E’n. 
of 
In covariant notation the energy momentum tensor for the mono- 
chromatic wave, Eq. 5.21, is given by 
r= Chee (5.31) 
1 For two tensors F#* and G“” with their duals F*,» and G*,» we have the general 
formula 
4g"Fo°G op = FUG, — G*HOF 5°, (5.30) 
In Eq. 5.29 we have used a special case of this formula with G4” = F’, 
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which in view of the light-like momentum vector, k? = 0, satisfies the 
relations 


rk =O, 
EES! (5.32) 
and 
Te Tg 


In general, for any null field given by Eq. 5.24, the energy momentum 
tensor can be written in the form 


T(x) = €7P)P'R) (5.33) 
with /? = 0. Clearly, 
Tl, =0 
and also the two last properties in Eq. 5.32 are satisfied. The matrix T 
has, thus, a light-like eigenvector with the eigenvalue zero. 


For a general radiation field given by its Fourier decomposition, Eq. 
5.26, we obtain 


T’ = —(2n)78 fe ®@*®) dkdk' {fa"(k)a,(k')k°k” 
— a®(k)a*(k’)k°k, — a®(k)ag(k’)k"k” + a%(k)a’(k’)k*ki] 
+ 2g" La%(k)ag(k')k?k, — a%(k)a"(k')k kp} (5.34) 
In the evaluation of the energy-momentum four vector 
P’ =| door? = far” 


The x-integration can be carried out giving 5(k + k’). Then k +k’ =0 
implies kj = +k, or 
k= —k’. 
Thus, we find 
P®=(2n)"? ¥ J dka°(k)a*(k)k?, (5.35) 
o=2,3 


where in the basis e;) introduced above we have to sum only over the 
transverse indices of a’(k). 
Equation 5.35 expresses the total field energy as a sum over plane 
waves each with two transverse degrees of polarization. 
We remark that the Fourier decomposition, Eq. 5.26, can also be 
written as 
A"(k) = (2n)~* § dke™**™* 5(k?)A"(k) 


where the coefficients A“(k) are different than the a"(k) in Eq. 5.26. The 
same results can be obtained by using this expansion. 
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3. Canonical Formalism for the Radiation Field in Terms 
of the Transverse Vector Potential 


Finally we make use of the transverse character of the electromagnetic 
field to give a canonical form of the field equations which eliminates the 
redundant components from the beginning. 

Any vector A can be split in the form 


A=A™+ Az (5.36) 
where, by definition, the transverse component satisfies 
V-A’=0 (5.37) 
and the longitudinal part is soleinoidal 
Vx A=0, (5.38) 
and, therefore, must be the gradient of some potential 
AL = Vy (5.39) 
The components A’ and A” are “orthogonal” in the following sense 
J d?xA7-A* = —[(V-A7)W d*x =0. 


The subsidiary condition A4,=0 in the gauge A“(0, A) becomes the 
radiation gauge 
yoA = 0) 
which implies that 
V‘A-=0 or V*~=0. (5.40) 


The only solution of the Laplace equation which is regular everywhere 
(in the absence of sources) is a constant. Hence we can put 


AL =0 (5.41) 


Because 


10 
a A 
= cot 


we can make a similar splitting for E into its longitudinal and transverse 
components and obtain 


and (5.42) 
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The field equations are now 
(7A? = 6°, A™ + 8°0,AT = 0; (5.43) 


no subsidiary condition is needed, since V-A’ = 0 by definition. 
From the second order Lagrange equation, Eq. 5.43, we can pass to 
first order canonical equations 
E = —@,A™ 
and (5.44) 
Oo9E = —(V°V)AT. 


These Hamilton’s equations are gauge invariant, because A’ is. Under a 
gauge transformation A” — A” + A” the new potential 


A’ = A™ + AY 
is related to the original one by 
A’ =A7™+VA. 
However, since V x VA = 0, we have 
AT =A" and Aj= doh: 


In Eqs. 5.44 E and A’ may be considered as the conjugate set of 
variables. This is an important point for quantization. 

The canonical equations, Eq. 5.44, do not look manifestly covariant; 
E and in particular A? do not have simple transformation properties. The 
equations are, however, relativistically invariant. 

The essential point in the derivation of the canonical Eqs. 5.44 was 
the use of the radiation gauge V-A = 0. Inorderto have the same canonical 
equations and the same physical interpretation, we must impose V‘A=0 
in every frame. This is possible only if we make, together with every Lorentz 
transformation, also a suitable gauge transformation to ensure the validity 
of the radiation gauge. Let 


Aix’) = Lj A(x) 


be the transformed field; the field in the new frame, however, will be 
taken to be 
A,(x’) = L,” A(x) + A,,(x), 
where we choose the gauge A in such a way that 
ps (4) (he. OX 
V-A = 04,/dxi = —(L,"A — jot=9, 
xl Xx Ay k v(x) Ox} Se Ax ax} 0 
Hence we find 
A, = —L,’A,/. 
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In mathematical language the invariance of the canonical Eqs. 5.44 
involve more general representations of the Lorentz group; E and A? 
are geometrical objects transforming according to this, what we may call, 
the gauge dependent representations of the Lorentz group. If we denote 
these representations by D(L, A) the group property reads now 


D(L,A,2)D(L,A}) a D(L, A), 


the identity element being D(/, 0). 

Quite generally, if a theory admits more than one invariance principle, 
or group of transformations 7;, we can use joint representations D(T;). 
For example, every time we make a Lorentz transformation we can also 
simultaneously transform our physical quantities under 7;. This may lead 
to simplified theories as was the case for the canonical treatment of the 
electromagnetic field. 


4. Energy and Momentum Radiated 


In this section we discuss the energy balance between the particle and 
its field and energy and momentum lost by radiation to infinity. 

In the wave zone the invariant F’’F,, vanishes and the energy-momen- 
tum tensor is given by the simple expression 


T"(Z) = F(z) F,"(2), 
where F“*(z) is the far field depending on the acceleration and given by 
Eq. 5.17. A straightforward calculation gives 
T(z) = (e/4n)(1/R°){[R(x — 2)*2* — Q(x — 2)"2? — (uea)] 
x [R(x — 2),2” — O(x—-2),2” — (6 BT} |s=s5 
where again (<> v) means terms with p and v interchanged; Rand Q have 


been defined by Eq. 5.7. 
Noting that 
(x — 2(So))? =0 
and 
2(s$)Z4(So) = 0 


e\? (x — zx —2)" (2 5 

EVE S\N es pe SS ae ee Fac bal 

bisa (5) R? (2 +R? 

We show first that 7“"(2) is positive definite as it should be. The 
acceleration 7 is a space-like vector (in the rest frame of the particle 


we find 
(5.46) 


S=S0 
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2: (0, 2)), hence z? <0. Furthermore, we decompose (x — z) into a time- 
like and an orthogonal space-like vector (see Eq. (1.9)). Let 


(x —z),, = RZi+y (5.47) 
where 
y?><0 and zy=0. 
Indeed, we obtain 
(x — 2)"Z,=R>0 


and 
Q=(x — z)’Z, = y%Z,. 
If we form 
- O=(x — 2(so))? = R27? + y? =R? 4+ y? 
we find 


y? Paha R?2. 
Now from Schwarz inequality (Chapter I), 


Q’ = (y"Z,)’ S zy” = R?|2|? 
or 
O/Rmea2\y- 
This equation shows that T’"(Z) is indeed positive definite. Moreover, T*” 
is the greater as Q is smaller. We obtain the maximum of T“” where Q 
vanishes, i.e., when 
(x = Al ae = 0; 


in the rest frame, (x —z)*Z=0; i.e., in a direction (x — z) perpendicular 
to the acceleration the energy momentum tensor has its maximum. 
From the conservation law 


HY oftv 
T it vow 


where t*” is the matter tensor of the particle (see Eq. 4.10), we now evaluate 
the energy and momentum given from the particle to the field in some 
space-time region v including the particle world line: 


AP# = -[o, dadx= +{ dxT"’, F (5.48) 


The field produced by the event z(s,) propagates along the forward 
light-cone (outgoing spherical waves with velocity c). The field produced 
by the event z(so + ds) propagates in a similar light cone, so that the field 
associated with the arc AB of the world line of the particle is confined to 
the region between the two future light cones. (Fig. 10). To evaluate the 
energy and momentum radiated, we take a large three-dimensional surface 
S and calculate the energy and momentum flux across it. We take, thus, as 
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the volume v in Eq. 5.48 the region surrounded by the two light cones 
and the surface S. 


Fig. 10. The radiation produced by the particle along its trajectory 
from A to B is confined to the region between the two forward light 
cones C and C’. 


We transform the volume integral, Eq. 5.48, into a surface integral 
APt = | do,T”” (5.49) 
: Boundary 


Part of the boundary, namely the surfaces of the cones, C and C’, does 
not contribute to the integral, Eq. 5.49. For T”” is proportional to (x — z)” 
and along C and C’, do, is orthogonal to (x — z)’. Physically, this means 
that the energy, of course, does not cross the two cones. Therefore, the 
outward radiation of energy is given by 


APH = | do,T", (5.50) 
iS 


as we have anticipated. We have written AP” to show that this radiation is 
associated with a small segment of the motion of the particle. 

Since the choice of the surface S is arbitrary and, in particular, dif- 
ferent observers may choose different surfaces, we must first show that 
AP" defined by Eq. 5.50 is independent of the choice of the surface S. 
This will establish the four-vector character of AP” which is essential. 

For this purpose consider another surface S’ and the region v’ of the 
space-time surrounded by S, S’ and the two-light cones, C and C’. Since 
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this region of space-time does not contain the world line of the particle, 
we have the conservation law 


le: dx =0 


{ do,T*” — [ do,T” + [ ao,7” | do,T” =0. 
s < Cc e 


Again the last two integrals are zero! and we have 


or 


[ do,T* = [ da,T*”, (5.51) 
Ss Ss’ 


i.e., the energy momentum crossing the surface S’ will also cross S. AP# 
being independent of the choice of the three-dimensional surface S, isa 
four-vector. 

The surface S is space-like; its normal is in the direction of the space- 
component of (x — z) according to the decomposition 


(x -—z)=Ri+y. 
Thus, we write the surface element on S as 
do, = y, dsRdQ; 


In the rest frame S can be taken to be a sphere of radius R; dQ is the solid 
angle. We have then 
AP* = —{ dsRdQy,T", 
or with Eq. 5.46, 
] 


2 
AP* = —(e/4n)? ds { dQ RB y(R2" + y*\(R2" + y’) & + #|| : 


This equation simplifies by noting that 


yz”=0 and y,y’= —R?, 


AP* = —(e/4n)? ds | a (Rz" + y*) (2 + 3) 


Now the integral J dQy* ‘++ is Zero, since y is orthogonal to the surface. 
We further put 


Q? = (y%Z,)? = y?z? cos? 8 = —R2z? cos? 0, 


1 These integrals are not zero if we take the total field rather than the far field, 
But the contributions, of course, vanish at large distances. 
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where @ is just the azimuth of dQ. Then 


AP" = —(e/4n)? ds J dQz"27(1 — cos? 0) 
or 


(5.52) 


This is the general covariant formula for the energy and momentum four 
vector radiated per unit proper time. 
Writing 


, at 

a= 7, (oy) 
cipal dt . _ av 
2-2/5 (0, ¥) + (¢,v) — a(5 Fale eer 
oS 


4 2 4 8 
Zz =~_— Les (7 ae 2(2 ri] ) ae a (v-4)?0?) ae c2 (*) (v-¥)?, 
c c c c 


and 


it 2 
a = (e7/42)3(z")(y/c)* ti + ¢ vi) (5.53) 
which, in the nonrelativistic limit, becomes 
AP®. 4 2” 
— = -- 5.54 
= (€*/4n) 5 (5.54) 
AP = (2/40); Me (5.55) 


with 


& (2) =c1erv-s 


‘ (y2/e2) = Ay/o)4 v-¥ 


2 
- (y/o = c i E +vev+3 (:) co] 
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For a finite interval of the world line AB the total radiation is given by 
B 

Pes — | (e?/4n) 3242? (5.56) 
A 


This formula assumes that the trajectory z“(s) is given, then P“ may be 
evaluated. However, strictly speaking, z“(s) cannot be given a priori, 
because due to radiation lost the particle must change its acceleration. 


5. Radiation Reaction 


(A) Energy Balance 


The radiation field of the particle changes the total field which acceler- 
ates the particle. Therefore, the motion of the particle changes. This is a 
type of “feedback” process. Another way of expressing this process is 
that with the act of radiation (or energy loss) the particle has to de- 
accelerate simply by energy balance. 

In the light of this phenomenon the equations of the motion of a 
particle discussed in Part I, Chapter II, are only approximately correct, 
for we have treated the external fields and therefore the forces on the 
particle as given. One way to include the radiation reaction into the 
equation of motion is to consider the energy loss caused by the radiation 
as the result of an additional (fictitious) force on the particle, the so-called 
reactive force or self force, K,excy. The Minkowski Eq. 2.10 will now be 
generalized into 

P= (moet) = Keg + Kreact « (5.57) 


It is instructive to start the discussion with a simplified and heuristic 
derivation of K,.,., in electrodynamics based on the idea of energy 
balance. 

We consider a one-dimensional radiating motion. The energy lost by 
the particle in a time interval T must be equal to the work done by the 
self-force. In the non-relativistic limit we have then 


Plone 


1 
way a. 2:2 es J 
af [ 3 4nc3 sai T | K ett X dt. (5.58) 


t 


1 This derivation goes back to Planck, Abraham and Lorentz. Fora historical review 
and a long list of references for the subject matter of this section see T. Erber, Fortschritte 
der Physik, 9, (1961), 343. 
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If we integrate by parts the left-hand side and choose a T such that x 
vanishes at both ends of the integral, we get 


“G e2 - e ‘ g 
: 3 4nc3 x ah t=0. 

One possible, but not necessarily the only, solution of this equation is 
obtained by putting the integrand equal to zero. Hence we find 


R. (5.59) 


The assumption made above that ¥ vanishes-at ¢ and ¢ + T is certainly 
true for oscillatory motion, but we expect that a result like Eq. 5.59 should 
be quite generally true. 

The form of the self force is rather unusual; it is proportional to the 
third derivative of the position x(t). Indeed, the radiation reaction is an 
unusual phenomenon. Accordingly, the equation of motion is a third 
order differential equation 


m& = F(x) + He2/4nc?)x, (5.60) 


and we need three initial conditions to specify a unique solution. We shall 
discuss the solutions of Eq. 5.60 in the next section. 

If we use the relativistic formula, Eq. 5.53, for the radiation energy, 
the form of the self-force is 


Keact = 3(e?/4nc3)y? f a (?) ‘(owe 3(2) ‘WV af 3(2) 0" 
(5.61) 


or, in covariant form 


a ae 
Kiec = 3 geal + 2Z J : (5.61') 


where the dot is now differentiation with respect to the proper time s. 
Equation 5.61 is, indeed, the unique covariant form of Eq. 5.59.1 

The above treatment is, as we had said, heuristic and phenomeno- 
logical. Even in the derivation of the radiation in Section 3 we have 
assumed the motion or the world line of the particle as given. The correct 
treatment of radiation and radiation reaction must consider both the 
particle and the field simultaneously. This will give us also an insight into 
the mechanism of the radiation reaction. 


1M. von Laue, Ann. Phys., 28, (1909), 436. 
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(B) Interacting Fields and Particles 


Let Fi*(x) be the given free external field which starts the motion and 
F** the self field of the particle. The total Lagrangian of a single charged 
particle interacting with these fields is 


‘ e : 
L = mocv? + }(F” + F’)\(F,, + Fit) + «ase, — z)(A" + At, 
(5.62) 


where in the last interaction term A” and A¥%, are the potentials corre- 
_ sponding to F*’ and F#*, and in the expression for current density, Eq. 4.14, 
we have replaced 6(x — z) by an invariant density of the charge p(x — z) 
for reasons which will be explained below. At any rate, in all the following 
expressions p(x — z) will be replaced by 6(x — z) to get the results for a 
point particle. 

If we now vary in Eq. 5.62 A, and A%, independently, we obtain 

(a) the equation of the motion for the particle 


mocé, = : | ds2"p(x — z)LFtt + Fin (5.63) 
(b) the equation of the motion for the self-field 
eee : | dsp(x — z)3# (5.64) 
and (c) the equation of the motion for the external field 
pe = | dsp(x — z)" x 0. (5.65) 


The fields F*” and F¢¥ satisfy different boundary conditions, in particular 
the external field usually is so strong that the inhomogeneous term in 
Eq. 5.65 can be neglected, so that we need only to consider the coupled 
Eqs. -5.63 and 5.64. 

In Eq. 5.63 the second term on the right-hand side is the usual external 
force. The first term represents the self-force. For point particles we see 
that to evaluate the self-force we need the self-field on the world line itself, 
It is easy to see that the self-field evaluated on the world-line itself is 
infinite. For example, we may consider the retarded field of the particle 


re 1 , re ’ 4 , , 
As =1 | avo {(e — x/),(x") dx 


or the corresponding F4”,. Since the function D**'(x — z) has a 6-function 
form, the field F4Y, evaluated at x = z is infinite; similarly F*” 


ret* 
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This may seem to contradict the earlier phenomenological result where 
we have found a finite and reasonable expression for the self force. When 
we look closer, however, we see that the two procedures are different. 
Whereas in the argument based on the energy balance we have only used 
the radiation on the far field of the particle, the self-force in Eq. 5.63 
contains the total field produced by the point particle. It is thus clear that 
even for a point particle the self-force has a finite, physically meaningful 
part associated with the radiation field, and an infinite part associated 
with the near field and the mass point character of the particle. 


(C) Finite Part of Self-Force 


It is indeed possible to extract, in a covariant manner, the finite part 
of the self-force associated with the radiation field and see the nature of 
the infinite part. For this purpose we write’ 


Fret = SFr + (Fi + Fas’) (5.66) 
where we have defined the radiation field by 
Eel iii py. (5.67) 
or, equivalently, 
pes ics ee (5.68) 


It will be calculated from the potential 


rad. __-s gout _—s gin 
A; ron A, 


or 
1 
Alt = = {o" — D**”) j,(x') dx’ = : | D(x — x')j,(x’) dx’ (5.68') 
c 
and represents the total change in the external field between ¢ = — o0(F;,) 


and t= +0(F,,,)- 

It will now be shown that the self-force corresponding to the term 
4F™4 in Eq. 5.66 is finite and agrees with the result, Eq. 5.61.2 The 
infinity is therefore due to the term 4(Fiy + F. 44°) in Eq. 5.66. We first 
calculate F7** by a procedure analogous to the calculation of the total field 


in Sect. 1 of the present Chapter. 


1 The splitting in Eq. 5.66 is not unique; more generally one can put 
Fyyte = k(Fret —_ Fav) + a = k)Fret + kFs4v, 


This, however, does not change the essential result to be derived. ' 
2p. A. M. Dirac, Proc. Roy. Soc., A 167, (1938), 148; P. A. M. Dirac, Ann. Institut 


Poincaré 9, (1938), 13. 
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From Eq. 5.68 we get 


Fuge | dsD(x — z(s)) — 7 = oe 


where, this is the difference from the previous calculation, the homo- 
geneous Green’s function is given by 


= (uory)| (5.69) 


D(x) = = 6(x?)e(x°). 


Actually, we must evaluate, for a point particle, F‘34 at x = z(s). Let z(s’) 
- be a fixed point on the world line and set 


s=s'+u., 
Then 
permenant Lae 
2(s) —2(8') Sitch Sct F eaghetine 
and 


: : Ss 
as)=2+uz+— 2+ ie 
Here all derivatives z, 2, Z, --- refer to the fixed point s’. Hence 
(z(s) — z(s’))? = u? + O(u*) 
and 
(2(s") — 2(s))"2, = —u + O(u) 


since 2? = 1. Furthermore, z°(s) — z°(s’) has the same sign as u (2 is time- 
like) which implies that 


e(z(s’) — z(s)) = —e(u). 
If we insert these expansions into Eq. 5.69 at x = z(s’) we get 
+a d 
Ftd NN Ge £ 2, a 
mls!) = = | __ dueu)s(u?) 


The u? 
uz, ian Pa 5 Zy2y + 5 Zety +E 2y2y _ nor). (5.70) 


If we use the identity! 
e(u)5(u2) = —8'(u). 


1 To prove this identity we write 


, Sia), 3 
Bu!) = lim 3(u2 — a) = im —— - as “|. >0 
then 
_ 4. f&u—a) &u+a) F 
e(w)8(u) = Tim |“ ad = — 5, 
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We can integrate Eq. 5.70 by parts and obtain 
d 
Fra ah e 2 
mi(2(s)) = = f dud(u) 5 


ee u? 

Ee + 5 22, + > ZZ) + 7a Bye — (po »| ‘ 
After differentiating with respect to u twice and doing the 6-integration, 
we see that only the third and the fourth terms in the bracket contributes. 
Also higher order terms in u give vanishing contribution and we find a 
covariant and simple expression for the radiation field. 


ra 2 e ,= oa 

Fe4(z(s)) = a= (2,2, — 2,Z,). (5.71) 

We can now use this expression to evaluate the corresponding self 
force. From Eqs. 5.66 and 5.63 we have for point particles 


Kia = Al dsz*8(x — 2(s))}F i 6.72) 


a 
= ae §(272,2, — Z,). 


If we differentiate zz, = 1 we get 2”Z, = 0 and if we differentiate once more 
we have 
2,2" = —2?. 
We have finally 
Kitae = —3(e?/4n)(Z, + 2,27). (5.73) 
The notation on the left-hand side indicates the self-force on the particle 
due to the radiation field. 

The final result, Eq. 5.73, which agrees with Eq. 5.61’, is true only 
for a point particle; for an extended particle one has to make a new cal- 
culation from Eq. 5.72 with 6(x — z) replaced by p(x — z) which has to 
be a given function. 

Equation 5.73 has another interesting aspect. If we equate ee to the 
energy momentum per unit time given to the field 


a AP 
K i seit i 7 ’ 
then 
rn 
— = —4(e?/4n)i"2? — ¥(e?/4n)2". 


The first term is exactly the radiation energy momentum, Eq. 5.52, which 
is positive definite and represents energy and momentum which is given 
to the field irreversibly. The second term, on the other hand, is not positive 
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definite and must be associated with energy and momentum which is 
emitted and absorbed by the particle. To agree with the energy balance 
relation Eq. 5.58, we must now show that the integral 


t+T t+T 
| Zhe. dsie -| z? ds 
fs t 


vanishes if T is such that 7 vanishes at both ends of the integral, which is 
obviously the case. The vanishing of 7 at both ends was the same assump- 
tion which led us from Eq. 5.58 to Eq. 5.59. 


(D) Mass Renormalization 


Now we turn to the infinite part of the self-force 


Rega x | ds2°5(x — z)(Fist + Fav (5.74) 
where 
Fat t= e | dior + pity S [Be — 2 Se uy] 
with 
Di = x 5((x — z)?)6(x° — 2°) (5.75) 
and 


pe - 
D* = = 5((x — z)?)6(z° — x°). 


The field, Eq. 5.74, is infinite at the point x = 2(s). Physically, we know 
that it represents the effect of the near field of the particle. In quantum 
mechanical language the particle carries with itself a cloud of photons 
which are continuously emitted and absorbed. It is, therefore, to be 
expected that this force contributes to the inertia or mass of the particle. 
The near field is always present even if there is no radiation, for example, 
for a uniformly moving particle. Thus, the experimentally measured mass 
already includes this effect. In other words, if we can show that the force 
in Eq. 5.74 has the form 

Kyser = —OmcZ, (5.76) 


then the equation of motion, Eg. 5.63, can be written with Eq. 5.66 or 
Eq. 5.73 as 
(m + dm)cz, = Fin(z)2” + 3(e?/4n)(2, + 2,22). (5.77) 


We can then put 


m+dém=m,,, (5.78) 
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and would have a meaningful, finite and complete equation of motion 
for the particle. This is the form of the so-called ‘‘mass-renormalization” 
in classical electrodynamics. Mass and charge renormalizations play an 
important role in quantum field theory. 

The significance of the procedure of mass-renormalization is that the 
original or bare mass m does not really have a physical significance; one 
always measures m + 6m = m,,,. Even if we neglect, as is done most of 
the time, the radiation effects, we cannot neglect the effect of the near 
field which results in 6m-term. 

The procedure has also allowed us to incorporate a formally infinite 
expression into a finite one, m,,,. One would be inclined to say that the 
original unobservable bare mass m is also somehow infinite due to the 
point nature of the particle so that the sum m+ 6m becomes finite.! 
Since we cannot operate with infinite quantities, we must say that we have 
reached the limitations of the theory using a point particle and in order 
to obtain finite results we must introduce a phenomenological density 
distribution p(x — z) for the particle whose real nature is perhaps outside 
the domain of classical electrodynamics. 

The nature of the mass renormalization term, even if it has been made 
finite, should be looked either in quantum effects or in general relativistic 
effects. Quantum electrodynamics, however, has not solved the problem, 
for the mass-renormalization terms there are still infinite. 

Now we must justify Eq. 5.76. In order not to operate with infinite 
quantities, we write Eq. 5.74 with p(x — z) instead of 6(x — z). It is actually 
a little simpler to go back to the Lagrangian Eq. 5.62. The mass renor- 
malization term is given by 


L= | asst — z)(Af + At”) (5.79) 
and 
(A + AM) sx <| D(x — z)z, ds 


where D is the symmetric Green’s function 


D os 4p" aE De) (5.80) 
D(x — z) = D(z — x). 


1 Clearly, mass renormalization is equivalent to throwing away the 6m-term com- 
pletely and identifying m with mexp as before. This shows the phenomenological 
character of the whole procedure. A different interpretation will be discussed in 


Chapter VI. 
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We now proceed in a similar manner as in Section I, Eq. 5.1. 
sagt + Ash) = Ay = | dsd(oe— 27), {060° ~ 29) + 002° — 2°). 
rs 


We now get contributions from both of the roots of the 6-function z(s9) 
and z(s,) (see Fig. 9). 


2 


= ms ses Zy ———— A 
A,() < 4n E _ Z)rze S=So a (x a 2)iZy = 


Thus, 
2 Su) 5 su ° 
vince as| ee $e | ole - 20). (5.81) 


~ an X — 2(S9))°Za(So) (x — 2(81))”Zo(51) 


We evaluate the two integrals separately by expanding z(s) around so and 
51, respectively. With s = sy + u we have again 


2 
2(s) = 2(so) + uZ(So) + > Hs.) + -°- 


(x ~ 2(9) = (% ~ 2665) = ua(50) +“) 


The numerator of the first term in Eq. 5.81 is of the form 1 + u?/2z-z. If 
we take terms of the order of u, the numerator is one. For a point particle, 
the Lagrangian becomes then equal to an infinite constant, we get 


L=6mc (5.82) 
with 


e? 
dm = — | duu 


which is precisely of the form of the inertial term of the Lagrangian of a 
particle. 

Dirac’ has considered a small sphere of radius a in the rest frame of 
the particle (i.e., a tube along the world line). He calculates the component 
of the tensor T*” normal to this surface, —a@~!T' ™n,, and the energy 
momentum flow out of this surface, 


[ [see —£2,(F E+ dF, ds 


neglecting terms which go to zero with a. Because this expression must 
depend only on the endpoints the integrand must be a perfect differential, 
b,. He chooses b, = kz,. This is not the only choice but the simplest one. 


1 Opus cit. above. 
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Now the mass renormalization is carried out by putting 


$e7/a — k = Mage 
and one obtains again 


a ay 
Mexp CZ, = = 2,(F ta Dasa 
= Kk a Kx (finite part) 


Historically, Lorentz' has introduced a three-dimensional extension for 
the particle as a charged sphere and calculated the radiation reaction as a 
series in powers of the radius a of the sphere. The term which remains as 
a- 0 is essentially the finite part which we have evaluated. However, a 
three-dimensional “‘size’” is not a relativistically invariant concept. 
Furthermore, such a model for a charged particle would not be stable, 
since any finite charge distribution would extend under the influence of 
external forces, unless some additional, say mechanical, forces are intro- 
duced to hold the charge distribution together. 

It is instructive to develop also an approximative noncovariant deri- 
vation of the spatial part of Eq. 5.77 using a space- and time-dependent 
charge distribution p(x, ft). We shall also consider at the end a spherical 
charge distribution. 

We start from the expression 


: ix) 
ret adv) _. dx’ B 
HAS + An’) =| * Gaix — x" 


x [6(x° — x — jx — x’) + 6(x® — x® + [x — x’). 


The dx°’-integration can be carried out giving 


ax’ 
ee Ae alice ~—x| 
x LIX’, x° — |x — x’) + fx’, x° + |x — x’})]. 
We now expand j,(x° + |x — x’|) around the time x°. 
1 aan 


ret adv 
ee = oe Ix — x’| 


x [2 1) + (=x Se 4 ax — wt Te 4 i 


1H. A. Lorentz, The Theory of Electrons (Dover Publ., New York). 
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The corresponding field is then 


j,O,R — j,0,R 
Ix = x’ |? 


ret adv ri ’ dig ae ~~ 
srg + ryt) = [ax [an he 


|x 


1e ie | ae 
+ 573 Bt vOuR — JuOvR) + 33 53 Guan — Suv) + °° (5. ) 


where we have put R = |x — x’ |. 
The electric field is given by 


E= —4(For + For’) = — 7 cR Ot R29 22 oF 


J ( il oj VR. VR ajo ) 
a? x eee : 
4nc 


or, noting that j”; (cp, pv) we obtain 


1 5./( 1 aeeR Tey ae (x — x’) d?p .) 
a ~ 4ne ad (Rr Vee CR’ Ge ee 2cR Ot? : 
(5.84) 


A similar expression can be written down for the magnetic field. 
The contribution of E to the self-force can be calculated from the 
equation 
F = [ p(x xQEG, x) ax: (5.85) 


In evaluating this integral we see that due to the equation of continuity 


the first term of Eq. 5.84 does not contribute to F. Indeed, if we integrate 
by parts we get 


Sead x wpa. PA ba aces 3ur (Xia 
a RY o v| @xor-v(z) = v | @xon rue 


the integrand of Eq. 5.85 corresponding to this term is odd under the 
exchange of x and x’, hence the double integral 


(x — x’) 


a =. 


| | d>x d?x'p(x)p(x')v- 


For the same reason the third term of Eq. 5.84 does not contribute to F 
either. In the last term we make the following steps: 
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(x —x’) 0 dp ,(x—x’) a 
[ox®or ce — | d’x’ —_— — V-(pv) 


=  [ xpy V as) 

o ia. 
= 5 | @xpv (5 —(x-x’) S = ,) 
=[ax(2 = x me pv (x — x). 


The first term of this expression combines with the second term of Eq. 5.84 
and we finally get the electric force in the form 


- 1 , P(X, x°)p(x’, x°) [, Xx’ 
= ~ gin | Peas ————— [i+ =x) | 
(5.86) 


If the dependerve of p on x is spherically symmetric, the second integral 
is 1/3 of the first integral, therefore 


= 2 6 3 Sg p(x, t)p(x', t) 
F= ie ax dx = (5.87) 


In terms of the electrostatic energy of the charge distribution 


_ 1 p& DK.) 13 03, 
ve 5 [ae arg d?x d3x (5.88) 
we have finally 
a - 
Bee See One. (5. ) 


For a spherical charge distribution of radius a, Vo = e?/a hence 6m is 
finite and equal to 
Quer 1 
~ 3 2nc? a’ 
This term becomes infinite as a — 0. There are other terms in the expansion 
whose contribution to dm vanish as a0. We shall come back to the 
problem of mass renormalization once more at the end of Chapter VI. 


ém 


6. Equations of Motion with Radiation Reaction 


Assuming that a mass renormalization has been carried out, the 
Lorentz-Dirac equation of motion of a charged particle interacting with 
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an external and with its own field is 


s e extsv 2 e = 5 2 5,90 
mez, =~ Fyyz +5 ig an + 2:2) (5.90) 


with the subsidiary condition 
me = 1, 


In the present section we shall consider solutions and interpretation of 
this equation. 

We note that the solution of Eq. 5.90 is the actually observed motion. 
We can therefore ask first the simple question when a uniform or a 
uniformly accelerated motion can be observed. To obtain # = 0 we must 
have Fi*'=0, but the converse is not true. In the absence of external 
fields the equation of the motion 


Z, = 1(Z, + 2,27) (5.91) 
2 (e?/4n ; 
=3 (4) (5.91’) 


admits solutions other than the physically reasonable one, namely Z, = 0. 
Indeed, if we multiply both sides of Eq. 5.91 with Z, we obtain 
Oe ee ae 
- (Z,,2") = 2,2" = ads (Z,2") 
which has the solution 
2? = c7e78/t, (5.92) 


Similarly, multiplying both sides of Eq. 5.91 with z, and antisymmetrizing 
we find 


1 d 
F SA 2y— 225) = (2,2, — Z,Z,) = — (@,2,— "Ze 


ds 
the solution of which is 
Za, — yee agcl (5.93) 
Let us choose a frame such that at the proper time s = 0, 
2(0) = (1, 0, 0, 0), 2(0) = (0, C, 0, 0), 


i.e., acceleration in the rest frame at s = O is in the direction of the x’ -axis, 
then it follows from Eq. 5.93 that 


Kio => —Ko, = G, all other K,, = 0, 
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and we can integrate Eq. 5.93. The result is 
Zo = cosh[t(e*/* — 1)] 


Zz, = sinh[x(e"* — 1)] (5.94) 


al) 

Za— 0, 
This solution is clearly unphysical, because the velocity of the free particle 
tends ultimately to the velocity of light. These are the so-called “runaway 
solutions,” or “‘self accelerated solutions’’. 

The equation of the motion, Eq. 5.90, as it stands, is therefore incom- 
plete and must be supplemented by some additional condition to eliminate 
the unphysical solutions, if we accept that, within classical limits, it gives 
us a closed description of the motion. 

For the case of the free particle discussed above the solution of the 
difficulty is immediate. The equation of the motion, Eq. 5.91, is of third 
order and we need three initial conditions at s = 0, namely z(0), 2(0) and 
2(0). In the above runaway solution we have assumed an initial accelera- 
tion 2(0)' = C which we were not allowed to do. If we therefore put C = 0, 
we are left with only the physical solution z= 0 or z = (1, 0, 0, 0) for the 
particle. 

One should have, however, a general physical prescription to choose 
this third constant of the motion in more complicated cases. In essence 
the particle should not accelerate over long time intervals more than the 
external forces are capable of doing it by energy conservation. A general 
method of incorporating this condition, or equivalently, of choosing the 
third constant of the motion is obtained, if Eq. 5.90 is transformed into a 
set of integro-differential equations of motion. 

Let us write the first term of Eq. 5.90 as a general time dependent 
force term, K,(s). It is actually in most cases velocity dependent and there- 
fore unknown. We consider cases where K,(s) considered as a function 
of s is bounded, even at s = +00. We multiply the equation of the motion 


= _ 2° 
Z,—1Z,=K, +122, 


by the integrating factor e” “* and obtain the equation 
d 4 wie 
—? Gis ieee t?"2,) 
which we can integrate, so that 


« eZ, = -| ds’ ee As) db 1z7(s')z,(s’)] se tZ,(0). (5.95) 
0 
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If we require that the increase of the acceleration is less than exponential 
(for the type of bounded forces considered), the left-hand side of Eq. 5.95 
will approach zero as s— oo. Hence we can determine the integration 
constant to be 


ZAO)ee = [as e-*'"TK,(s’) + 127(s')z,(s’)]. 


Because of the assumed boundedness of K,(s’), this integral will exist. 
We obtain so the desired integro-differential equation 


s/t fo 
2,(s) = — | ds' e~*"[K,(s’) + t2°(s'),(s’)] (5.96) 


with t given by Eq. 5.91’. Clearly every solution of Eq. 5.96 is also a 
solution of Eq. 5.90; it suffices simply to differentiate this equation if the 
integral exists. But the converse is not true, because we have required the 
boundedness of K, and, asymptotically, an increase of Z, less than the 
exponential. For example, the runaway solution, Eq. 5.94, for the free 
particle does not satisfy the integro-differential Eq. 5.96; the integral 
blows up. 

Equation 5.96 is of second order; we need only the initial position 
and velocity as the third constant of the motion 2,(0) has been determined 
by the asymptotic condition. 

We rewrite Eq. 5.96 by a change of variable s’ > 5’ + 5 


1 | ; 
Z,(s) = { ds e™* es: +s) + 727(s’ + s)z,(s’ + s)] 
0 


= [a e-"[K,(s + ta) + t27(s + ta)z,(s + ta)] (5.97) 


0) 


Although the solution of the integro-differential Eq. 5.96 are uniquely 
determined by initial position and velocity, this equation is still quite 
different from the Newton equations. The causality in the usual sense does 
not hold: not only is the acceleration at time s determined by all the future 
motion, but also the motion starts before the action of the force. For 
example, if the force is zero for s <5 , the acceleration of the particle 
is different from zero for s < So. But the time constant for these pheno- 
mena as given by the exponential factor e~5/* is very small. For electron 
t = 0.62 x 1072? sec. The classical picture surely breaks down in what 
happens over such small time intervals (think of uncertainty principle). 
Even if it did not, these phenomena cannot be observed at the present 
time. At any rate the microcausality would no longer be valid, but the 
macrocausality, the observable causality, would still be valid. 
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Some explite solutions of Eqs. 5.90 and 5.96 have been obtained 
recently by Plass.' (See also Exercise 18.) 


7. Theory of the Electromagnetic Mass 


Here we present a different interpretation of the equation 
m+ 6m = Mey (5.98) 


which goes back to Thomson, Lorentz, Abraham, Poincaré, etc. It is the 
hypothesis that the mass or inertia of the electron is entirely due to its 
own field; and, furthermore, that the momentum and spin of the particle 
are momentum and spin of the particle’s own field. In other words, we 
could put my = 0 and 

6M = Mey. (5.99) 
The measured mass of the particle is a result of the motion of the initially 
massless “‘particle” in an external field. Although this idea appears to be 
very attractive it is not possible, at the present time, to build a complete 
theory on this basis. Certainly the quantum effects must be taken into 
account. But even within the framework of quantum theories the nature 
of the mass of the particles remains unexplained. In the early days of 
relativistic field theories, one used the concept of ‘“‘bare mass” and “‘re- 
normalized mass,’’ but the renormalization terms are still infinite in 
quantum electrodynamics; these concepts seem to be inapplicable to 
nuclear particles. 

The hypothesis of the electromagnetic mass take a simple form for a 
particle at rest and states that the total relativistic energy (mass) of the 
particle is equal to the total energy of the fields associated with the particle. 
The latter quantity for the electromagnetic field is given by 


1 (A We he ome dr 
00 43, 4+ | p2 43 
[rz axny | Bas -(S 4 -= (5 r 


and is infinite due to the lower limit of the integral. If we assume a spherical 
extension for the particle of radius a and equate the total energy to the 
rest energy mc” we find 


oo 2 
e| or = a = mc2 (5.100) 
eS pam 
or 
2 
Geo ei0> 


1G. N. Plass, Rev. Mod. Phys., 33, (1962), 37. 
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This quantity is the classical electric radius of the electron which also 
arises in the calculation of Thomson or Compton cross-sections. ! 

This kind of argument has a certain amount of validity. If we apply 
the same reasoning to other fields, say the mesonic field of a static neutron” 
which is of the form described by the Yukawa potential 

e ur 
a g aap 
where p is the mass of the meson and g is the analogue of electric charge e, 
_ the nucleonic charge, or the so-called coupling constant of the mesonic 
field to nucleons. We can now compare the masses M and m of a nucleon 
and an electron, respectively, with the corresponding field energies.? 


| T°, Aur? dr 
Mg (5.101) 
tha | T1o!4nr? dr 


It is interesting that the limit on the right exists. We find 


al 1 ee 
M | { (4) +r as 
ae lim — -———____—__ 
ae | (e?/r?) dr 
g° g? 
= lim[(ua + 1) e"**"] ==. (5.102) 
e a0 ie 


If we equate this ratio to the experimental number 1836 we find 


gy’ 

SES = 13. . cy 

a 4 (5.103) 
The value of g?/he obtained in meson physics is very close to this number, 
715. 

A difficulty encountered in the electromagnetic mass hypothesis which 
has been frequently discussed concerns the transformation properties of 
the energy and momentum of the field of the particle. The energy and 
momentum of the field must transform like a four-vector if they have to 
represent the energy and momentum of the particle. 


1 See Exercise 11. 


F Bs Or of a proton if we neglect the electric field compared to the strong mesonic 
eld. 


3G. Stephenson, Nuovo Cim., 5, (1957), 1009. 
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Consider the frame of reference in which the particle is at rest. In this 
frame T,,, is diagonal, the energy density is 


To = 4E’, (5.104) 
the momentum density G = 0 and 
T,, = (Ei — 4E?). (5.104’) 


Let us transform into a moving frame x’ with 


gaat F 
i al + Bx*) 


| ee , 
By mt ale + Bx®) 


(5.105) 
x?” = x? 
x: 
The energy momentum tensor transform as 
Ox* Ox? 
fe ipa (5.106) 


a dling 
or in component form 


. 1 
Too = z (Too + B7T11) 


ey: 
Tio Tag (Too + T11) 
Tro = T3o = 0. 


The total energy is the integral of Too, J Too d°x. Now the volume element 
d°x also transforms 


d>x' = y d°x. 
The energy momentum in the moving frame can be defined as 
Gravina x. (5.107) 
Now a covariant definition of the energy momentum 
Ga=)'Tyalo” (5.108) 
implies necessarily that G, is a four-vector. If in addition we have 
T,;' =9, (5.109) 


i.e., free fields, { 7,,,do” is independent of the space-like surface. We can, 
therefore, take the space-like surfaces at t=0 and ¢’ =0 so that the 
covariant definition agrees with Eq. 5.107. However, at the position of 
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the particle the condition, Eq. 5.109, is not satisfied. Using Eq. 5.105, we 
obtain from Eq. 5.107 


= =| (ts a Bi) d°x 
? (5.110) 
=! leg 4 Tax: 


Using the value of the electrostatic energy of the field in the rest frame 


Go = J Too d°x =4 [ E? d®x = Uy 
we get 
= (Ti; d*x = —4U,. 


Hence, in the moving frame 


3+ p? 
Co ear 
30 =) (5.111) 
Gi; = 40" 


which do not agree with the transformation of a four-vector. There is no 
real difficulty or paradox here, because as we have mentioned above, the 
assumption, Eq. 5.109 does not hold at the position of the particle and, 
consequently, the covariant definition Eq. 5.108 is not surface inde- 
pendent and does not reduce to Eq. 5.107. 

Let us take the definition G, = { T,,do”. In order to obtain G, = mv, 
we must choose 


1 
do=n’do= - u’ do, 


i.e., a surface normal to the direction of motion. We have then 


(5.112) 
1 1 
G, = ie Tu’ do = “fn Tio do + | T,v do 
y 


so that the correct definition has additional terms which restore the correct 
transformation property of the energy momentum four-vector G,, as it 
must be because G, is a four-vector by its very definition. 
There is no sce problem with the transformation of the total charge. 
Let 
Q =f j° d*x (5.113) 


Radiation and Radiation Reaction 203 
be the total charge in the rest frame. In the moving frame 


peel 
p= oe (since j = 0) 


and again d°x' = yd°x. Hence 
Cyr x hj ax = 0. 
Of course, the covariant definition 
= j do,,j" (5.114) 


with the equation of continuity 
jf, =0 (5.115) 


implies that Q is independent of the space-like surfaces and one can choose 
surfaces at tf =O and ¢’ = 0. 


8. Further Developments and Exercises 


(1) Derive the Lienard-Wiechert Potential using the advanced Green’s 
function. 

(2) Calculate the field of a uniformly moving charged particle using 
Lienard-Wiechert potentials and compare the results with Exercise 5 in 
Chapter III. 

(3) Evaluate the potential of a nuclear particle emitting a classical 
scalar field. 

(4) Show that for a radiation field F,,F,, and F,,(x —z), are sym- 
metric in (410) and (va), respectively. 

(5) Prove Eqs. 5.18 and 5.19. 

(6) Compare the definition of the radiation field for a single particle 
as that part of the field which depends on the acceleration and which 
behaves as 1/R at infinity, F,,(Z), with the covariant definition, Eq. 5.67. 

(7) Show that the field of a uniformly moving particle approaches a 
null field in the limit of extreme high velocities. 

(8) Calculate the energy radiated per unit time and the radiation field 
of a harmonic oscillator (in the limit v/c < 1). How long will it take for 
such a classical oscillator to collapse due to the loss of energy? 


(9) Angular distribution of radiation. From the discussion preceding 
Eq. 5.13 we see that for slowly moving charges the angular distribution 
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of the radiated energy (1/R field) is proportional to sin?0, where 6 is the 
angle between the acceleration pf and the vector n of the point where one 
measures the radiation. For fast moving particles we go back to Eq. 5.11. 
The angular distribution is proportional to 


{n x [(n — 8) x BJ}? 
(1 — n-B)° 


sin? 6/(1 — p cos 6)° 


or to 


which is peaked forward in the direction @,,,, With 
1 
3B 


The greater is the velocity the smaller is 6,,,,. 


COS Omax = = (1 + 158? — 1). 


(10) Frequency spectrum of the radiation. If we examine the form of 

E(2) Eq. 5.11, taking, for example, a monochromatic oscillator v ~ cos Wot 

_and 6 ~ sin Wot, we see that due to terms of the form vi, etc., the radiation 
is no longer monochromatic. 

Also, because the radiation is peaked in the direction of motion a 
detector will receive a pulse during a time Af so that the radiation will 
contain frequencies up to 1/At. 

If we write the energy radiated per unit time and solid angle in the 
form |J(t)|? and if I(@) is the Fourier transform of I(t), then by Parseval 


theorem 
J U@I? dt =f |I(@)|? do. 


Hence we can express the total energy radiated over all times by an integral 
over the frequencies. Hence I(w) gives the frequency spectrum of the 
radiation. 


(11) Electron-photon scattering; classical limit. If the momentum 
transfer between the photon and electron is small we can treat the motion 
of the electron nonrelativistically. For photons we use a plane wave 
E = E,e*'*-°., The electric field imparts to the electron a velocity 


v=—E 
m 
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so that the energy radiated by the electron in the solid angle dQ and per 
unit time 


dP? se? = 1 Oe ns 
a agen 5 Gmelt oP 2) ane 


The differential cross-section, o(@), which is the ratio of the energy radiated 
per unit time and solid angle to the incident energy per unit and time, is 


then given by 
214 
o(6) = (“ on =) sin? 0. 


The total cross-section, 


o a (=e) _ 8, 
a nc 3S 


is called the Thompson cross-section. The quantum mechanical treatment 
of this Compton effect is given by the Klein-Nishina formula which 
agrees with the above classical result only at very small energies and angles. 


(12) Bremsstrahlung. This is the process of emission of photons or 
electromagnetic radiation when charged particles in the Coulomb field of 
other particles change their acceleration. If we can estimate the acceleration 
of the particle we can use our radiation formula to calculate the so-called 
Bremsstrahlung. 


(13) Multipole radiation. So far we have looked at the radiation field in 
Cartesian coordinates. It is also useful to introduce polar coordinates 
r, 0, @. Instead of linear momenta k and a superposition of plane waves 
e'* one obtains angular momentum variables |] and m and a superposition 
of spherical harmonics Y,,,(9, ¢), or multipole fields. 

We recall the well-known multipole expansion of an electrostatic field 


—— [225 


obtained by using the identities 


oo 
= ) +7 Pi(cosa), r>r 
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where a is the angle between x and x’, and 


P(cos a) = ——-— x Yin(O', $')Yim(9, @) 


2i+1, 
where (6, @) and (0’, d’) are the angles of x and x’. The result is 


eee inl 2) 
Mx) = 2 aa Oe 


r 


where the coefficients C,,, (multipole moments) are given in terms of the 
charge distribution p: 


Cim = j Yin(8', ¢')r'p(x’) dx’. 


The multipole moments C,,, are related to the components of the 
moments of the charge distribution in Cartesian coordinates, for we have 
also the expansion 


1 px 1 X;x 
@(x) = -— iy. Eee ig 
(x) =| (x) dx ate ei +520, a 


where p is the electric dipole moment 
p= J x’p(x’) dx’ 
and Q;; are the quadrupole moments 
ig = J (3x;x), — r’? 6,)p(x’) d5x’. 
Now we consider the radiation field of localized oscillatory charges. 
We start from 
J*(x') dx’ 


A*(x so a oe 
(2) = Abus + 4nc} |x —x’| 


5(x° — x + |x — x’). 


Assuming oscillatory charges j(x) = Jo(x) e~ *° and a gauge (0, A), we have 
eiklx— x’| 


Ao) = 2 | 400 Me ee l= a 


At large distances from the charge distribution we can expand 


Ix —x'|2=r—n-x’ 
and 
hoe 1/2 2K 
7 (-- ik)(ax) +3(5 = k(n) SF 
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The contribution of the first term in this expansion gives the electric 
dipole field, the second term the magnetic dipole and the electric quadru- 
pole fields, and so on. The vector potential for the dipole field, for example, 
is 

elk . eikr 
A(X)aipole = —— ‘) d°x = —ikp — 
(x)a pol ae Jo(x’) d°x ikp 7 
where we have used a partial integration and the equation of continuity 
V°Jo = imp. Thus 


‘ eikr 1 
Baipoie = V X Agipote = k*(n X p) a 


‘g 
i ikr 1 ik 
Eaipote = k Vx B=k(nxp)xn — + [3n(n-p) — pi(-3 = =) elke, 


outside the sources. At very large distances from the source 
elt 
2 
Baipote = K*(m X p) i, Egipote = Baipote X 0 


as it should be. It is then easy to see that the total energy radiated per unit 
time in the form of the dipole radiation is proportional to the absolute 
value square of the dipole moment p. 
Finally we discuss the separation of the radiation field in angular 
momentum variables. 
The method is similar to the electrostatic case discussed above. We 
expand the Green’s function in terms of the spherical harmonics 
elklx—x' (2) : I 
Fay = KY Aker’ VhiM( kr) YY Vin(O'O')Yin(9, 9), 4 > 
4n|x — x’| 1=0 m=-l 
where j, and h{”) are the spherical Bessel and Hankel functions respectively. 
Hence 


00 


ACs) == SHG) CinYin(Os 8) 


t= m=-lI 
with 
Cim = J Fo(x')i(kr’ VV in(8'G') ax’. 
Once the multipole moments C,,, are known from the current distribution 
Jo(x’) we can evaluate A(x) and then the fields by 


B=VxA, E=7VxB. 


Note: One can also use directly the equations satisfied by E and B in the 
source free zone 


(V2+k2)B=0, V-B=0, E=7VxB 
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or, alternatively, 


(V2+k)E=0, V-E=0, B=—-Vx E, 
and solve the Helmholz wave equation (V? + k?)E =0 in polar co- 
ordinates. The result is the same. 

A particular solution with fixed / and m is called a multipole field; the 
general form of the radiation is thus obtained as a superposition of 
multipole fields. 

We leave the details and angular distribution of multipole radiation 
as an exercise. 

(14) Discuss the field of a uniformly moving particle in a material 
media (Cerenkov Radiation). 


(15) Scattering from a bound electron and Rayleigh scattering. Consider 
for simplicity a one-dimensional harmonically bound charged particle 
interacting with an external plane electromagnetic field. The equation of 
the motion including the radiation reaction is 


2 e7/4n... 
mx = —mwex + eft Xx + eEy cos wt 
where @ is the angular frequency of the oscillator and w that of the 
external field. We assume a physically acceptable steady state solution of 


this equation of the form x ~ cos(wt + a), so that X = —w?x. Hence 
. eee. 
mx = —M@p5x — eo w°x + eE, cos wt 


The solution of this equation can be written as! 


eE,/m , 
x(t) = Re : i 
© (qo ae 
where 


_ 2e*/4n 


T= 
Bee’ 


1 Write, for example, the equation of the motion in the complex form 


mzZ = —mwo2z + zee 
Sees 


wz + eEp e-iwt 


and use the 
Z = const e-twt 
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The average acceleration square is given by 
x? = Ie E2 Bae 
Qe (on w) + T7202 
and the radiated energy per unit time 
dP®  2e7/4n1 cn w* 


dtd2 3 c 2m °(@—w) +T’o?" 


Again if we divide by the initial flux the cross-section for the process is 
given by 


w* 


ee 
B(@? — w2)? + Ta? 


In particular for free electrons and small A/w we get back the Thompson 
cross-section o; (see Exercise 11). But otherwise the cross-section shows a 
resonance at @ = Wo. The mechanism for the resonance is the “excitation” 
of the electron and increase its radiation. 

In the region w < wy, and I’ < wg (strongly bounded electron) we get 
the Rayleigh Scattering formula 


the cross-section increases very rapidly with the frequency. 

(16) Discuss the nonrelativistic limit of the equation of the motion of 
a charged particle with radiation reaction, and the nonrelativistic limit of 
the integro-differential Eq. 5.96. Show that the unique solution of 


e/t foo) F 
ie = Eipjies: (de 
t 
for given F(t) is 
t (eB co 
x(t) = 2(0) + £(0)t + | dt | a’ | F(t" + ta) eda. 
0 0 0 


(Compare Eq. 5.97.) 


(17) Damped harmonic oscillator. (Same problem as in Exercise 15.) 
The most general solution of the equation of the motion for a damped 
harmonic oscillator, but without external field, 


tx —X — w2x =0, 
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can be written as 


3 
x= Yaeae 
k= 


where the p;° are the roots of the characteristic equation 
tp’ — p*’ — wi =0. 


This root equation has one positive real and two complex conjugate roots 
with negative real parts. The real root p, 


=i t ame 2\-1 ‘ 
Pi =(A, + A_) A, = ae Foe + (279) 
leads to a runaway solution. But if we take C, = 0, the physical solution is 
x(0 
x(t) = au) e~/T sin wt 
@ 
where 
1 


2 2) 
sas Ls 2 = 
Tg) Atut Aa) aeeiaaas 


Obtain this solution from the integro-differential equation. 


(A2, — A”). 


(18) Obtain the exact solution of the one-dimension relativistic equa- 
tion of the motion with radiation reaction in terms of the external force 
which is assumed to be independent of velocities. (See G. N. Plass, Rev. 
Mod. Phys. 33, (1962) 37.) 


(19) Passage to quantum theory. The discussion of the radiation field in 
Section 2, in particular the energy Eq. 5.35, is a most natural and direct 


starting point of introducing field quantization as well as ordinary quan- 
tum mechanics. 


Let us write Eq. 5.35 for a finite volume V in the form 
P=H=2V 2 k?a(k)-a*(k) = di E,. 
The momentum of the field becomes 
G= 2 (E,/c)k. 
If we introduce the new coordinates 


Qi. = (V/e*)[a(k) + a*(k)] 


and 
Q, = P, = —iw(V/c)[a(k) — a*(K)]; wo =ck 
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we get 
H=5¥( (Pi + @7Q?) 

and the correct Hamilton’s equations with this Hamiltonian. Thus the 
radiation field in a box is a sum of a countable infinite number of oscillators 
(normal modes) each characterized by a frequency w(k) = ck, k =|k|. 
According to Planck’s hypothesis each normal mode has a definite energy 
ha. But the number of oscillators is a random variable. For example, 
in the case of thermal equilibrium, the probability of having n normal 
modes of energy fiw is exp(—nhaw/kT) and one obtains easily the Planck’s 
law of the black body radiation. (Derive this formula.) The alternate 
and more common interpretation that each oscillator has energies 
noh(n = 0, 1, 2, ---) with probability exp(—nhw/kT) is not correct from 
the point of view of field quantization. 

If we interpret each normal mode of energy hw as a relativistic particle 
of momentum hk and mass zero as far as the energy momentum properties 
are concerned, as explained in Chapter II, it is natural to ask what corre- 
sponds to the electromagnetic field and the field equations []74, =0 in 
the case of relativistic particles of nonzero mass. As discussed briefly in 
Chapter II, we introduce again the wave vector k and frequency w by the 
equations p=hk, E=haw. Then p? —(E/c)? = mc? implies for these 
matter waves the equation k? — (w/c)? = m*c?/h? and the corresponding 
wave equation for which the plane wave decomposition coincides with 
this relation between k and @ is 

(Ci? + m?c?/h) = 0, 

the Klein-Girdon equation, which is the basic equation of quantum theory 
of massive spinless particles. The nonrelativistic limit of this equation 
gives the Schrédinger equation. The one particle aspect of the Schrédinger 
equation has been very much emphasized in textbooks at the expense of 
its field aspect which in a sense is more fundamental and primary part of 
quantum theory. Historically quantum theory of fields (at least the free 
fields) actually preceded the ordinary quantum mechanics. 
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A. 


A CTION 


AT A DISTANCE 
ELECTRODYNAMICS 


So far, we have considered the field as a physical system on its own 
right interacting with the particles. When this picture is applied to the 
case of electromagnetic field interacting with point particles, we found an 
infinite self-energy, hence the necessity of mass renormalization and the 
breakdown of microcausality in the motion of charged particles when the 
radiation reaction is taken into account. 

It is also possible to eliminate the concept of field and introduce only 
action-at-a-distance interactions between the particles. A single particle, 
in this approach, does not produce a field of its own, hence has no self- 
energy. The relativistically invariant interactions between the particles is 
such that they simulate the field between them. 

To this class of theories belong the electrodynamics of Wheeler and 
Feynman which gives a physical interpretation to the finite part of the 
radiation reaction. Under this interpretation the Dirac prescription of 
calculating the finite part of self-field (Chapter 5, Sec. 4) and the resulting 
equation of the motion would be exact. 

These questions as well as a covariant treatment of the mass renor- 
malization are discussed in this chapter. 
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1. The Action Principle of Fokker-Schwarzschild-Tetrode 


We consider a number of charges e, of masses m,, « = 1, --- N inter- 
acting with each other. The interaction is introduced by means of an 
action integral W4 


W=\V fdemz2+Ye. Y eg dtdr'z,(c)2,(t')D(z,—z,) (6.1) 
a a BHa 


where t and 7’ are invariant times for the two particles and D(x) is the 
symmetric Green’s function both in z, and z, and in past and future. 


D(x) = $(D'*"(x) + D*(x)) 

D(z, — 23) = D(z, — z,). 
The only dynamical coordinates in the action W are the particle co- 
ordinates z,, Zz, °*: . The second term of W is the sum of all possible two- 
body interactions which are symmetric in a and f as it should be. This 
requirement of symmetry fixes the Green’s function employed up to a 


factor. 
The Euler Lagrange equations for the «-th particle are 

7) 7) 
_ m,Z5(s) = e,23(s) al ds' e, és.) i, = 25>) Bz 


If we consider the right-hand side of Eq. 6.3 as a force, K“, we see that it 
satisfies, as it should 


(6.2) 


Kc —Z,) (6.3) 


K*z,, = 0. 
It represents a force “‘propagating” with a velocity c, for the force at a 


proper time s on the «-particle comes from all other particles Z, at proper 
time s’ such that 


(z,(s) — Z,(s‘))? = 0. 
To see the relation of this equation with that of the field theoretical 


equation of motion we define formally as the field of the particle B at 
the point x, the quantities 


A(x) = eg J dsD(x — Z,(s))2g,(5), (6.4) 
hence the corresponding field is 


Pee, | ds(zg4(0) “ — Zp, <) D(x — z,(s)). (6.5) 


1 We take in this chapter, for simplicity, the units such that c = 1. 
H. Tetrode, Z. Phys., 10, (1922), 317. 
J. Frenkel, Z. Phys., 32, (1925), 518. 
G. N. Lewis, Nt. Acad. Sci., 12, (1926), 22. 
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Thus, in the sense of the definition, Eq. 6.4, the right-hand side of the 
equation of motion, Eq. 6.3, involves the sum of the fields of all particles B 
at the position of the particle a: 


Wye ee,zs) >, Fe (z.), (6.6) 
B 
If we define a field . 
FRe)'—= >) sF4\(z,) (6.7) 
B 
Bta 


which is, according to Eq. 6.2, a sum of retarded and advanced fields. 
We can put 


FEO.) = 4(FutArret ae Fete adv) 
also 
P= 1G, + F(z). 
With these definitions the equation of motion 


mice =e, (6.8) 


is exactly the same as that of a single particle in a given external 
field F¥, given by Eq. 6.7 and Eq. 6.5. Moreover, the field, Eq. 6.7, 
satisfies Maxwell’s equations 


(2 A4(x) = ys ey § dsz4(s)_]?D(x — zp) 
Ba 


= 2 es | dsz, 5(x — 25), 


B Fa 


(6.9) 


if we define the current density of the particle B by 
ig "(x) = eg dszh d(x — 25), 


then the right-hand side of Eq. 6.9 is the total current density of all other 
particles except a, for which it is the effective current density. The sub- 
sidiary condition is also satisfied 


ged ise 
At (x) = » es | dsze ai D(x — 2») 
+a (6.10) 


= —D(x — z,) = 0. 


s=- 


Thus we see that the present theory based on an action-at-a-distance 
action principle is almost equivalent to the Maxwell-Lorentz theory except 
that the equation of motion, Eq. 6.8, does not contain any term cotre- 
sponding to radiation reaction. As a matter of fact, there are no radiation 
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effects in the theory, but only interactions of a number of particles. Con- 
sequently, there is also no radiation reaction. The field which enters in 
Eq. 6.8 is, however, of quite a different character than the field produced 
by a particle in Maxwell-Lorentz theory. Experimentally we observe a 
radiation. This phenomenon will be interpreted in this theory by taking 
the detector of the radiation as part of the system. The particles of the 
detectors (absorber) will be affected by other particles of the system which 
is interpreted as radiation. 

Let us summarize the interpretive statements of the theory. 

(1) An accelerated point charge in otherwise particle-free space does 
not radiate. 

(2) The “‘fields” which act on a given particle arise solely from other 
particles. 

(3) These fields are represented by half the retarded plus half the 
advanced Lienard-Wiechert solutions of the Maxwell’s equations. The law 
of force so obtained is symmetric with respect to past and future (and with 
respect to interchange of particles). 

(4) To discuss the radiation of a system, we have to assume the 
existence of charged particles (absorbers) at the boundary of the volume 
in which the given system is located. The absorbers will be accelerated by 
the system which is interpreted as the radiation. The absorbers in turn 
accelerate the system which is the radiative reaction. 

To elaborate further the last point, let us write the equation of the 
motion, Eq. 6.6, in the equivalent form 
mia ett (FAN +R — Fim > © Le — Fame), 


2 
Le all g 


(6.11) 
Here the first term is the contribution of the usual retarded field of all 
other particles and represents the usual external force in Maxwell-Lorentz 
theory. The second term is the contribution of the finite part of the self 
field of the a-particle which, as we have seen, corresponds to the self force 
a la Dirac. The last term, finally, is zero, if one makes the assumption that 
all radiation emitted by ail particles is absorbed by the absorbers:! If a 
source has emitted radiation during a certain time interval which has been 
completely absorbed, then a very long time afterwards DFO is zero. 

B 


The quantity )'F{/**" is also zero at the same time because the source is 
B 


1 J. A. Wheeler, and R. P. Feynman, Rev. Mod. Phys., 17, (1945), 157; R. P. 
Feynman, Phys, Rev., 74, (1948), 939. 

For a comparison of field theory and action-at-distance theory see P. Havas, 
Phys. Rev,, 74, (1948), 456. 
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no longer emitting any radiation. Now because 
~(B)ret _ -(B)adv 
2 LFae = Fee] 


satisfies Maxwell’s equations, it vanishes at all times if it vanishes at one 
time. 

For a consistent theory one has to require further that: 

(1°) the effect of the absorber on the sources must be independent of 
the properties of the absorber, and 

(2°) this effect must be completely determined by the motion of the 
source itself; 

(3°) the absorber must exert on the source a force which is finite and 
simultaneous with the moment of acceleration and sufficient in 
magnitude to take away from the source the energy which goes 
as radiation; 

(4°) the absorber produces a field equal to }(F** — F**”) at the position 
of the sources. This field combines with the source’s $(F™ + F*4")to 
produce the observed FT. 

With this interpretation the action W gives the correct radiation re- 

action without an infinite self energy. 


2. Action Principle with Self-Energy 


The self-energy problem is easily discussed with the help of the action 
W, since it is equivalent to the usual formulation. 
We now add a self-energy term in the action W of Eq. 6.1 by writing 


W' =) m, Jf dtzi+ ; Y ese J ds ds’Z,(s)zg(s')D(z,(s) — zp(s’)). (6-12) 
@ a,p 


The action W’ differs from W by the term a = B which is infinite due to 
the D function. We therefore replace 


cme on 
D== (6x — xy’) 
by 
5 ple - x) 
a 


where p(x?) is an invariant function. Instead of Eq. 6.4 we now define a 
new field 


A(x) = Ef dsigy(s)olx — 24(5)) (6.13) 
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and obtain from Eq. 6.12 for the equation of motion of the a-th particle 


ms 


Wgeeg = faz, Px, Pz.) 
all g 


= eatt| > LO Godt Fea] (6.14) 


Ba 
— ext self 
oe ea oy Sg 


We remark that the field, Eq. 6.13, does not, strictly speaking, obey 
’ the Maxwell equations on account of p instead of 6-function density: 


- e : 
CAP) — a J dsz,,[ p(x — zp), 


unless we modify the current density slightly as one should. 
The subsidiary condition 


dZz, 


At, =z-Jds as 


0 
2n Bat PO — 20) 


= ~ Ff dolx —z,(s)) 


is satisfied if p falls off rapidly as s+ +00. One can choose p(x”), for 
example, in the following form 


1 
p(x?) = 5,3 oxP (—|x?{/a) for x?>0 
=0 formx? <0 =) 


where a is a characteristic length, say, of the order of the electron radius 
a e?/mc?, so that it behaves like a 5-function at large distances where 
Maxwell’s equations are strictly satisfied. ; 

We can write the equation of motion, Eq. 6.14, in a slightly different _ 
form by eliminating the term absorbed by the absorber: 


. y = : = 1 
Meda = Cyt Y, FO = ext] ye ae = Y (FOr — a) (6.16) 
all B all B all B 


where we have introduced 
Fret =f JF — Py 
In Eq. 6.16 the second term on the right-hand side is again zero, hence 


Mdgy eg EF, (6.17) 
all B 
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so that F'*t appears as the effective field. If we separate once more the 
damping and self-force terms, we get 


meee = ii oy Pipe a Fo at 4(Fiaret = siti 
BHa 

i 6.18 

= Ke at Koo Ket reaction, ( ) 


which is the same result as obtained in the previous section. 


3. Mass Renormalization 


We show in this section how the self energy term can be incorporated 
into the inertia or mass term in a covariant way. The term a = f in the 
action W’, Eq. 6.12, with an invariant density p has the form 


5 62 f ds ds’e(s')o(ea(5) — 24669). 6.19) 


Since p is peaked like a 6-function only the values of z,(s’) close to z,(s) 
is important, or, if we let s’ = s + u, only small values of u. We therefore 
expand z,(s + u) around s. Then 


(z,(s’) — z,(s))’ = u* + O(u*) 


uz 
z,(s’) = z,(s) + uZ,(s) + - z(s)+e°* 


and Eq. 6.19 becomes 


x e2 | ds duz,(s) [20 + uz,(s) + > Z,(s)+°° ] 


x p(u? + O(u‘)). - (6.20) 
The second term gives a vanishing contribution since z,Z, = 0. The first 
term gives 
i! +a 
— e i as| dup(u?) 
4n Bae 
or, if we introduce instead of s the arbitrary parameter 
+o 
e2 | dtz? | dup(u?) = | dt 6m,22 (6.21) 
Te —-o 


where we have defined 


+0 
eee? | Guna). (6.22) 
4x ee 
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We get thus a self-energy term which has the same form as the first term 
of Wor W’. 

For the function p given by Eq. 6.15 we find 


oA 1 +o 


- Adopting the hypothesis of the electromagnetic mass and equating 
dm = M,x,, We get, with correct units, 


1 e? 


as we have anticipated. 

The higher order terms in the expansion, Eq. 6.20, give, however, 
correction to our result that the self energy term is completely inertial. 
For, example, the third term gives 


A o2 | du puty | dst 
4n a 9) p Za s Zq by 
which, unfortunately, is not of the form of Eq. 6.21. Thus, with a covariant 


cut-off the problem of mass renormalization on classical level is not 
completely solved within this approach. 
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“_ | | we.can only hope that more such striking expositions will be written.” 
— Bulletin of the American Mathematical Society 


Students of theoretical physics interested in the classical underpinnings of modern 
quantum field theory will welcome this advanced text. It is a systematic, covariant 
treatment_of the classical theories of particle motion, fields, and the interaction of 
fields and particles. Particular attention is given to the interaction of charged particles 
with the electromagnetic field. 


The treatment throughout the book is relativistic; the author attempts, as much as 
possible, a coordinate free (or covariant) form of the equations both for particles and 
the fields. The book opens with an extensive discussion of space-time, Lorentz trans- 
formations, Lorentz-group and tensor and spinor fields. This material is essential to 
the understanding of many branches of theoretical physics, in particular relativistic 
quantum theory. Chapter II describes various relativistic forms of the fundamental 
problem of dynamics: describing the trajectories of particles for given external forces. 
The general dynamical principles to obtain the field equations and the important 
problems of the conservation laws are discussed in Chapter III. 


The second part of the book (Chaps. IV-VI) is devoted to a lucid treatment of the in- 
teractions of fields and particles. Chap. IV deals with equations of motion and their 
solutions (the so-called Cauchy problem), focusing on the solution of field equations 
with Green's Functions using Dirac formalism. The problem of feedback between par- 
ticles and fields (radiation and radiation reaction) is taken up in Chap. V, as are ques- 
tions concerning the limitations of classical field theories and classical dynamics. Dr. 
Barut concludes the book with an excellent exposition of the purely mechanical ap- 
proach to the problem of the interactions of charged particles—the so-called action- 
at-a-distance formulation of electrodynamics. 


Written ona sophisticated level suited to mature graduate students, this unique and 
important work includes many discussions of hitherto unpublished original work. In- 
deed, it is considered by many to be the first comprehensive treatment of relativistic 
electrodynamics ever written. To further assist the student, Prof. Barut (a highly dis- 
tinguished theoretical physicist now at the University of Colorado, Boulder) has in- 
cluded problems, alternate proofs, and additional topics at the end of each chapter, 
where a useful bibliography is also provided. A general bibliography appears at the 
end of the book. 


Unabridged, corrected (1980) republication of the edition published by Macmillan & 
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